Credit Card Customer Segmentation

Project Overview

This project focuses on segmenting credit card customers based on their spending behavior and demographics. Using clustering techniques such as K-means, Agglomerative Clustering, and Gaussian Mixture Models, the analysis aims to uncover distinct customer groups. The insights generated help support targeted marketing strategies and personalized customer experiences.

Dataset

Dataset Source: Credit Card Dataset

Name	Description
`custID`	Identification of Credit Card holder (Categorical)
`remainBal`	Balance amount left in their account to make purchases
`balF`	How frequently the Balance is updated, score between 0 and 1 (1 = frequently updated, 0 = not frequently updated)
`purDone`	Amount of purchases made from account
`oneOffPur`	Maximum purchase amount done in one-go
`insPur`	Amount of purchase done in installment
`cashAdv`	Cash in advance given by the user
`purF`	How frequently the Purchases are being made, score between 0 and 1 (1 = frequently purchased, 0 = not frequently purchased)
`oneOffPurF`	How frequently Purchases are happening in one-go (1 = frequently purchased, 0 = not frequently purchased)
`insPurF`	How frequently purchases in installments are being done (1 = frequently done, 0 = not frequently done)
`cashAdvF`	How frequently the cash in advance being paid
`cashAdvTRX`	Number of Transactions made with "Cash in Advanced"
`purTRX`	Numbe of purchase transactions made
`creditLim`	Limit of Credit Card for user
`pymtDone`	Amount of Payment done by user
`minPymt`	Minimum amount of payments made by user
`fullPymtPCT`	Percentage of full payment paid by user
`tenure`	Tenure of credit card service for user

Project Objectives

Data Cleaning & Preprocessing
- Rename inconsistent feature names for clarity
- Handle missing values using cold deck imputation (mean substitution)
- Detect and remove outliers using interquartile range (IQR) and Z-score analysis
Exploratory Data Analysis (EDA)
- Compute key statistical summaries: mean, median, variance, and standard deviation
- Perform correlation analysis using heatmaps
- Visualize distributions with histograms and scatter plots
- Detect outliers using boxplots
Machine Learning Models for Customer Segmentation
- Data Normalization: Apply Standard Scaler to standardize feature distributions
- Clustering Algorithms: Implement and compare
  - K-means
  - Agglomerative Clustering
  - Gaussian Mixture Model (GMM)
- Dimensionality Reduction: Use Principal Component Analysis (PCA) for feature compression
- Cluster Validation & Model Evaluation:
  - Elbow Method: Determine optimal cluster count
  - Silhouette Score: Evaluate cluster compactness
  - Davies-Bouldin Index & Calinski-Harabasz Method: Assess clustering performance
- Predictive Modeling: Assign new customers to clusters based on trained segmentation model

Technologies Used

Programming Language: Python
Libraries: pandas, numpy, scikit-learn, matplotlib
Data Visualization Tools: seaborn, plotly

Project Workflow

Data Collection: Import and inspect datasets
Data Cleaning & Preprocessing: Handle missing values, normalize data, and remove outliers
Exploratory Data Analysis (EDA): Visualize distributions, correlations, and patterns
Feature Engineering: Transform variables for better clustering
Model Training: Implement and compare different clustering algorithms
Model Evaluation: Analyze clustering performance using multiple validation metrics
Results Interpretation: Identify customer segments and derive actionable insights

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
ClusteringDataModeling.py		ClusteringDataModeling.py
PreAnalysis.py		PreAnalysis.py
README.md		README.md
dataset.csv		dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Credit Card Customer Segmentation

Project Overview

Dataset

Project Objectives

Technologies Used

Project Workflow

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Credit Card Customer Segmentation

Project Overview

Dataset

Project Objectives

Technologies Used

Project Workflow

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages