Skip to content

Chracker24/JEE_Dropout_ML_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

82 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

JEET : AI Powered JEE and NEET Dropout After Class 12 Prediction app

Overview

JEET is a Machine Learning project designed to predict the likelihood of students dropping out after Class 12, focusing on those preparing for or appearing in the Join Entrance Examination (JEE) and/or National Entrance cum Eligibility Test (NEET) in India. BY analysing academic and socio-economic data, JEET helps educators, counselors, and students to identify at-risk indivuiduals for timely intervention.

Structure

JEE_Dropout_ML_Project/
β”œβ”€β”€ .gitignore
β”œβ”€β”€ LICENSE
β”œβ”€β”€ Plan.md
β”œβ”€β”€ README.md
β”œβ”€β”€ 01_Data/
β”‚   β”œβ”€β”€ 01_raw/
β”‚   β”‚   └── JEE_Dropout_After_Class_12.csv
β”‚   β”œβ”€β”€ 02_Cleaned and Engineered/
β”‚   β”‚   β”œβ”€β”€ Feature_Engineered.csv
β”‚   β”‚   └── JEE_Dropout_Cleaned.csv
β”‚   └── 03_final/
β”‚       └── JEE_Dropout_Final.csv
β”œβ”€β”€ 02_Data Analysis/
β”‚   β”œβ”€β”€ 01_Exploration.ipynb
β”‚   β”œβ”€β”€ 02_Cleaning.ipynb
β”‚   β”œβ”€β”€ 03_Feature_Engineering.ipynb
β”‚   └── 04_Merging_Final.ipynb
β”œβ”€β”€ 03_notebooks/
β”‚   └── Prototypes_models.ipynb
β”œβ”€β”€ 04_src/
β”‚   β”œβ”€β”€ evaluation.py
β”‚   └── jee_dropout_model.py
β”œβ”€β”€ 05_models/
β”‚   └── Model_JEET.pkl
β”œβ”€β”€ 06_deployment/
β”‚   β”œβ”€β”€ Home.py
β”‚   β”œβ”€β”€ requirements.txt
β”‚   β”œβ”€β”€ model/
β”‚   β”‚   └── JEET.pkl
β”‚   └── pages/
β”‚       └── Data.py
β”‚       └── JEET.py
β”œβ”€β”€ gitignore
β”œβ”€β”€ LICENSE
└── README.md

Features

  • Predicts dropout risk using student academic and socio-economic data
  • Handles imbalanced data with SMOTE for better accuracy
  • User-friendly web app built with Streamlit
  • Enables educators and students to take data-driiven actions

Dataset

includes features such as

  • Academic scores (Class 10, Class 12, JEE/NEET scores)
  • Attendance records
  • Socio-economic factors (family income, parent's education)
  • Target Labele: dropout status (Yes/No)

Installation

1. Clone the Repository

git clone https://github.com//Chracker24/JEE_Dropout_ML_Project.git
cd JEE_Dropout_ML_Project

2. (Optional) Create a virtual environment

python -m venv venv
source venv/Scripts/activate   #MacOS: venv/bin/activate

3. Install dependencies

pip install -r requirements.txt

Usage

  1. Navigate to Home.py in 06_deployment
  2. Run the Streamlit application using Bash
streamlit run Home.py
  1. Input stuudent details via the web interface to get dropout risk predictions
  2. Talk with Google Gemini powered JEET chatbot about your shortcomings and help

Methodology

  • Data Cleaning, encoding cateogorical variables, and feature scaling and engineering
  • Addressed class imbalance using SMOTE
  • Trained Random Forest Classifier as the main model
  • Evaluated with accuracy, precision, recall, F1-Score and ROC-AUC
  • used SHAP for model interpretability

Results

  • Achieved approximately 73% accuracy
  • This is due to presence of leaky data that got removed and will be fixed in later iterations with better data
  • Nonetheless, Improved recall with data balancing
  • Important predictors : socio-economic status, peer pressure and mental health

Future work and iterations

  • Incorporate psychological and motivational data
  • Explore advanced models (XGBoost, deep learning)
  • Add personalized intervention suggestions
  • Expand dataset for broader applicability and fix leaky data situation

Tech Stack

  • Python, scikit-learn
  • imbalanced-learn (SMOTE, Pipeline)
  • pandas, numpy, seaborn, matplotlib
  • SHAP (Interpretabiility)
  • Streamlit (deployment)

Contributors

Christy Chovalloor - Software Engineering Student, Queen's University Belfast Linkedin Github

Documentation

Documentation will be up soon and I will surely notify through my linkedin. Stay Tuned!

License

MIT License

Contact

For questions or collaboration, contact through email : Chr2412@hotmail.com Email me

About

πŸŽ“ Predicts JEE dropout risk using machine learning and provides GPT-powered, personalized advice β€” built for students, schools, and real-world impact.

Topics

Resources

License

Stars

Watchers

Forks

Contributors

Languages