ML Bootcamp Project

Anant Upadhyay, IIT (ISM) Dhanbad

This repository contains implementations of various machine learning algorithms as part of the ML Bootcamp at IIT (ISM) Dhanbad.

Project Details

This project is my submission for WOC 5.0 in the Machine Learning division. The training data used to train the models are stored in the Train_data folder, and the data on which predictions are made is stored in the Test_data folder. The final trained parameters for all the models are stored in .npy and .npz formats under the Trained_Model_Parameters folder to save time during model loading.

Algorithms Implemented

Linear Regression

Model: f(w,b,x(i))= wx(i)+b
Cost Function: Mean Squared Error $J(w,b) = \frac{1}{2m} \sum_{i=1}^{m} (f_{w,b}(x^{(i)}) - y^{(i)})^2$
Training: Gradient Descent $w=w-\alpha \frac{\partial J}{\partial w}, \quad b=b-\alpha \frac{\partial J}{\partial b}$

Polynomial Regression

Model: $f(w,b,x(i))= w_1x(i)+w_2(x(i))^2+w_3(x(i))^3+\cdots+b$
Cost Function: Similar to Linear Regression
Training: Gradient Descent $w=w-\alpha \frac{\partial J}{\partial w}, \quad b=b-\alpha \frac{\partial J}{\partial b}$

Logistic Regression

Model: Sigmoid function for binary classification, Softmax for multi-class $f_{w,b}(x^{(i)}) = \frac{e^{w_i(x) + b_i}}{\sum_{k=1}^{n} e^{w_k(x) + b_k}}$
Loss Function: Cross-Entropy $L(f_{w,b}(x^{(i)}), y^{(i)}) = -\ln(f_{w,b}(x^{(i)})_j) \quad \text{where} \quad j=Y[i]$
Training: Gradient Descent $w=w-\alpha \frac{\partial J}{\partial w}, \quad b=b-\alpha \frac{\partial J}{\partial b}$

Neural Networks

Architecture: Multi-layer perceptron with ReLU activation for hidden layers and problem-specific activation for output layer.
Forward Propagation: $a_j^{[L]}=g(w_j^{[L]}a^{[L-1]}+b_j^{[L]})$
Training: Backpropagation $w=w-\alpha \frac{\partial J}{\partial w}, \quad b=b-\alpha \frac{\partial J}{\partial b}$

K-Nearest Neighbors (KNN)

Distance Metric: Euclidean distance (squared) $d(x,y)=\sum_{i=1}^{m}(x_i-y_i)^2$
Classification: Majority vote among k-nearest neighbors
Regression: Average of k-nearest neighbors

Technology Stack

Python
Numpy
Pandas
Matplotlib
Jupyter/Google Colab Notebooks

Development Process

Data Splitting: Training data split into J_train and J_cv to address underfitting and overfitting.
Feature Engineering: Normalization using Z-score $z=\frac{x-\mu}{\sigma}$
Regularization: Added to cost function to prevent overfitting $ J(w, b) = \frac{1}{2m} \sum_{i=1}^{m} (f_{w,b}(x^{(i)}) - y^{(i)})^2 + \frac{\lambda}{2m} \sum_{j=1}^{n} w_j^2 $

Results

Linear Regression: 84.29%
Polynomial Regression: 99.99999999984027%
Logistic Regression: 87.47%
KNN: 83.33%
Neural Network (Linear): 84.29%
Neural Network (Polynomial): 99.995%
Neural Network (Classification): 85.6%

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
Prediction_made_by_the_models		Prediction_made_by_the_models
Test_data		Test_data
Trained_Model_Parameters		Trained_Model_Parameters
Training_data		Training_data
Anant_Upadhyay_ML_Bootcamp.ipynb		Anant_Upadhyay_ML_Bootcamp.ipynb
Anant_Upadhyay_Machine_Learning_Project_Proposal.pdf		Anant_Upadhyay_Machine_Learning_Project_Proposal.pdf
Anant_Upadhyay_Machine_Learning_Project_Report.pdf		Anant_Upadhyay_Machine_Learning_Project_Report.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML Bootcamp Project

Anant Upadhyay, IIT (ISM) Dhanbad

Project Details

Algorithms Implemented

Linear Regression

Polynomial Regression

Logistic Regression

Neural Networks

K-Nearest Neighbors (KNN)

Technology Stack

Development Process

Results

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ML Bootcamp Project

Anant Upadhyay, IIT (ISM) Dhanbad

Project Details

Algorithms Implemented

Linear Regression

Polynomial Regression

Logistic Regression

Neural Networks

K-Nearest Neighbors (KNN)

Technology Stack

Development Process

Results

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages