Employee Retention Analyzer - A Predictive Classification Model for Employee Attrition

Employee Retention Analyzer is a machine-learning (ML) project that uses HR analytics data to predict the likelihood of employee attrition. The model achieves this through a classification task, using the 'left' attribute from the dataset as the target and employee characteristics as features. The project provides HR teams with actionable insights for improving retention strategies. is a machine-learning (ML) project that uses HR analytics data to predict the likelihood of employee attrition. The model achieves this through a classification task, using the 'left' attribute from the dataset as the target and employee characteristics as features. The project provides HR teams with actionable insights for improving retention strategies.

Dataset Content

The dataset is sourced from Kaggle and has been adjusted accordingly for my project. Each row represents an employee and each column contains employee attributes. The dataset includes information about:
- Employee satisfaction levels
- Performance evaluation scores
- Number of projects
- Average monthly hours
- Time at company
- Work accidents
- Promotions
- Department and salary level
- Whether they left the company

Attribute	Information	Type/Units
satisfaction_level	Employee's satisfaction rating	Float (0-1)
last_evaluation	Last performance evaluation score	Float (0-1)
number_project	Number of projects assigned	Integer
average_monthly_hours	Average monthly work hours	Integer
time_spend_company	Years at the company	Integer
Work_accident	Whether they had a work accident	Binary (0/1)
promotion_last_5years	Whether promoted in last 5 years	Binary (0/1)
Departments	Department employee works in	Categorical
salary	Salary level	Categorical (low/medium/high)
left	Whether the employee left	Binary (0/1)

Name		Name	Last commit message	Last commit date
Latest commit History 231 Commits
.vscode		.vscode
app_pages		app_pages
images		images
inputs/datasets		inputs/datasets
jupyter_notebooks		jupyter_notebooks
outputs		outputs
src		src
.gitignore		.gitignore
.gitpod.dockerfile		.gitpod.dockerfile
.gitpod.yml		.gitpod.yml
Procfile		Procfile
README.md		README.md
app.py		app.py
full-requirements.txt		full-requirements.txt
requirements.txt		requirements.txt
runtime.txt		runtime.txt
setup.sh		setup.sh

Feature	Action	Expected Result	Actual Result
Project summary page	View landing page	Clear project overview displayed	Works as expected
Navigation	Click through sections	Smooth section transitions	Works as expected
Business requirements	View requirements section	Requirements clearly listed	Works as expected

Feature	Action	Expected Result	Actual Result
Correlation page	Navigate to analysis	Display correlation heatmaps	Works as expected
Feature importance	View importance plots	Show feature rankings	Works as expected
Performance metrics	Check model metrics	Display accuracy scores	Works as expected

Feature	Action	Expected Result	Actual Result
Prediction interface	Enter employee data	All inputs accept values	Works as expected
Run prediction	Click predict button	Show prediction result	Works as expected
Risk assessment	View prediction details	Display risk factors	Works as expected

alex025x/ERA

Folders and files

Latest commit

History

Repository files navigation

Employee Retention Analyzer - A Predictive Classification Model for Employee Attrition

Table of Contents

Dataset Content

Business Requirements

Hypothesis and how to validate?

The rationale to map the business requirements to the Data Visualizations and ML tasks

ML Business Case

Epics and User Stories

Epic - Information Gathering and Data Collection

Epic - Data Visualization, Cleaning, and Preparation

Epic - Model Training, Optimization and Validation

Epic - Dashboard Planning, Designing, and Development

Epic - Dashboard Deployment and Release

Dashboard Design

Page 1: Project Summary

Page 2: Project Hypotheses

Page 3: Correlation Study

Page 4: Attrition Prediction

Page 5: Model Performance

ML Business Case

Epics and User Stories

Epic - Information Gathering and Data Collection

Epic - Data Visualization, Cleaning, and Preparation

Epic - Model Training, Optimization and Validation

Epic - Dashboard Planning, Designing, and Development

Epic - Dashboard Deployment and Release

Dashboard Design

Page 1: Project Summary

Page 2: Project Hypotheses

Page 3: Correlation Study

Page 4: Attrition Prediction

Page 5: Model Performance

Technologies Used

Languages

Python Packages

Other Technologies

Testing

Manual Testing

User Story Testing

Validation

Issues

Heroku Deployment Error with Scikit-learn Version

Unfixed Bugs

Deployment

Heroku

Forking and Cloning

Forking

Cloning

Installing Requirements

Credits

Content

Data Cleaning Notebook

Modelling And Evaluation Notebook

Streamlit Dashboard

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages