Multivariate-Linear-Regression

This project implements a multivariate linear regression model from scratch using Python and NumPy, and compares it against scikit-learn implementations. This walks through the entire machine learning pipeline, including data exploration, feature scaling, gradient descent optimization, convergence analysis, prediction, and model evaluation.

Main Concepts Covered

This project focuses on building strong intuition for the mathematical and algorithmic foundations of linear regression:

Multivariate linear regression hypothesis
Feature scaling (mean normalization & standardization)
Cost function (Mean Squared Error)
Gradient computation (∂J/∂W, ∂J/∂b)
Gradient descent optimization
Learning rate and convergence behavior
Residual analysis and error interpretation
Model evaluation metrics (R², MSE, RMSE, MAE)
Comparison with scikit-learn implementations

Tech Stack

Python
NumPy
Pandas
Matplotlib & Seaborn
scikit-learn
Jupyter Notebook

Repository Structure

.
├── multiple_regression.ipynb   # Complete implementation and analysis
├── README.md              # Project documentation

How to Run

Clone the repository:

git clone https://github.com/<your-username>/<repo-name>.git
cd <repo-name>

Install dependencies:

pip install numpy pandas matplotlib seaborn scikit-learn

Open the notebook:
```
jupyter notebook enhance_linearR.ipynb
```
Run the notebook cells sequentially.

Model Workflow

Data creation & exploration
Visualization using pair plots and correlation heatmaps
Feature scaling to improve gradient descent convergence
Custom gradient descent training
Convergence analysis using cost vs iterations
Predictions on unseen inputs
Comparison with scikit-learn LinearRegression & SGDRegressor
Evaluation using standard regression metrics

Results & Observations

Feature scaling significantly improves gradient descent stability
Custom implementation closely matches scikit-learn results
Convergence curves provide insight into optimization behavior
Residual plots help validate linear model assumptions

License

This project is intended for educational purposes. A license can be added later if the repository is extended or shared for reuse.

Author

Abhi Learning-focused Machine Learning & Python projects

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
multiple_regression.ipynb		multiple_regression.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multivariate-Linear-Regression

Main Concepts Covered

Tech Stack

Repository Structure

How to Run

Model Workflow

Results & Observations

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multivariate-Linear-Regression

Main Concepts Covered

Tech Stack

Repository Structure

How to Run

Model Workflow

Results & Observations

License

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages