Bias Detection Dashboard for Machine Learning Models

A lightweight web-based Streamlit application that allows users to train a simple machine learning model and evaluate whether the model produces biased outcomes across different demographic groups. The goal of this project is to help users move beyond standard model accuracy and better understand fairness in AI systems.

Features

Data Input: Upload any tabular CSV dataset or use the built-in synthetic demo dataset.
Model Configuration: Select the target variable to predict and the sensitive attribute (e.g., gender, age group).
Automated Model Training: Trains a basic Logistic Regression model automatically.
Fairness Metrics: Evaluates potential bias by calculating Accuracy, False Positive Rate (FPR), and False Negative Rate (FNR) per demographic group.
Visualizations: Interactive and clear bar charts comparing performance metrics across groups.
Insight Generation: Automated text summaries alerting you if performance disparities exceed a 10% fairness threshold.
Modern UI: Clean and professional user interface powered by Streamlit and Ant Design components.

Prerequisites

Python 3.8+
pip (Python package installer)

Installation

Clone or download this repository.
Open a terminal/command prompt in the project directory (ai-bias-detection).

Create a virtual environment:

# On Windows
python -m venv venv
.\venv\Scripts\activate

# On macOS/Linux
python3 -m venv venv
source venv/bin/activate

Install the required dependencies:
```
pip install -r requirements.txt
```

Running the Application

Once your virtual environment is activated and dependencies are installed, start the Streamlit server:

streamlit run app.py

The dashboard will automatically open in your default web browser (typically at http://localhost:8501).

Usage Guide

Choose Data Source: On the sidebar, select Upload CSV or Demo Dataset.
- If using the Demo Dataset, the app will automatically load a synthetic dataset with built-in gender bias for loan approvals.
Configure Model Settings:
- Select the Target Column you want the model to predict.
- Select the Sensitive Attribute (the demographic group to analyze for bias).
Train Model: Click the "Train Model" button.
Review Results: Observe the overall accuracy, the bias evaluation charts, and read the generated insight summaries at the bottom to see if the model exhibits unfairness.

About the Demo Dataset

The provided demo_dataset.csv is synthetically generated to simulate loan approvals. It intentionally includes bias where the 'Male' group has a significantly higher baseline chance of approval independent of merit based factors (like income and credit score) compared to the 'Female' group. This demonstrates how the model inherits hidden biases in historical data and how the dashboard detects them.

Tech Stack

Frontend / UI: Streamlit, streamlit-antd-components
Data Processing: Pandas, NumPy
Machine Learning: Scikit-learn
Visualizations: Matplotlib, Seaborn

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
app.py		app.py
demo_dataset.csv		demo_dataset.csv
generate_demo_data.py		generate_demo_data.py
ml_utils.py		ml_utils.py
requirements.txt		requirements.txt
test_ml.py		test_ml.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bias Detection Dashboard for Machine Learning Models

Features

Prerequisites

Installation

Running the Application

Usage Guide

About the Demo Dataset

Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Bias Detection Dashboard for Machine Learning Models

Features

Prerequisites

Installation

Running the Application

Usage Guide

About the Demo Dataset

Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages