Retail Credit Scoring Model – Predicting Loan Defaults

Project Overview

This project simulates the development of a consumer credit risk scoring system, closely mirroring the modeling practices of banks, credit unions, and fintech lenders.
The model predicts the Probability of Default (PD) of retail loan applicants and segments them into actionable risk bands.

Methodology

Data Source:
Kaggle: Give Me Some Credit Dataset
Data Preprocessing:
- Imputed missing values
- Handled outliers
- Created raw and scaled datasets
Exploratory Data Analysis (EDA):
- Identified correlations between financial behaviors and defaults
- Addressed class imbalance (~8% defaults)
Feature Engineering:
- Debt-to-Income Ratio (DTI)
- Late payment flags
- Credit activity scores
- Age and income bands
- Log-transformed monthly income
Modeling:
- Trained Logistic Regression and Random Forest classifiers
- Evaluated using AUC-ROC, Confusion Matrix, Precision, Recall
Probability Segmentation:
- Grouped applicants into Risk Bands A–E based on PD

Key Results

Risk Band	Default Rate (%)
A	~2%
E	50%+

Models displayed strong discriminatory power (AUC well above baseline).
Clear monotonicity observed between PD and actual default rates.

Business Interpretation

Risk Band	Recommended Action
A, B	Auto-approval
C	Manual review
D	High APR or secured product
E	Decline application

Deliverables

Credit Scoring Report (PDF):
Detailed methodology, visuals, and business segmentation strategy.
Jupyter Notebook containing preprocessing, EDA, modeling, and evaluation.

This final report and credit scoring system were built with support from ChatGPT (OpenAI), used as a co-pilot for idea structuring, Python logic validation, and formatting refinement. The project reflects my own analytical judgment, assumptions, and execution, but benefited from AI-based structuring and iterative feedback throughout.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
notebooks		notebooks
Credit Scoring Report.pdf		Credit Scoring Report.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Retail Credit Scoring Model – Predicting Loan Defaults

Project Overview

Methodology

Key Results

Business Interpretation

Deliverables

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Retail Credit Scoring Model – Predicting Loan Defaults

Project Overview

Methodology

Key Results

Business Interpretation

Deliverables

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages