ScratchML

ScratchML is a Python library for implementing machine learning and deep learning models from scratch. The goal of this project is to provide a deeper understanding of how these algorithms work under the hood by building them without relying on high-level libraries like TensorFlow, PyTorch, or Scikit-learn.

Features

Supervised Learning Models (Classical ML)

Regression (Predicting Continuous Values)

Linear Regression (With Gradient Descent & Normal Equation)
Ridge Regression (L2 Regularization)
Lasso Regression (L1 Regularization)
Polynomial Regression

Classification (Predicting Discrete Labels)

Logistic Regression (Binary & Multi-class Classification)
K-Nearest Neighbors (KNN) (Distance-based classification)
Naïve Bayes (Gaussian & Multinomial)
Decision Tree (Entropy, Gini Index)
Random Forest (Ensemble Learning)
Support Vector Machine (SVM) (Hard & Soft Margin, Kernel Trick)

Boosting & Advanced ML

Gradient Boosting (AdaBoost, XGBoost, LightGBM)

Unsupervised Learning Models (No Labels)

Clustering Algorithms

K-Means Clustering (Centroid-based clustering)
Hierarchical Clustering (Agglomerative & Divisive)
DBSCAN (Density-Based Spatial Clustering)

Dimensionality Reduction & Feature Extraction

Principal Component Analysis (PCA)
Autoencoders (A neural network-based unsupervised model)

Deep Learning Models

Basic Neural Networks

Perceptron (The foundation of neural networks)
Multi-Layer Perceptron (MLP) (Backpropagation from scratch)

Computer Vision

Convolutional Neural Networks (CNNs) (With Conv, Pooling, Dropout)

Sequence Models

Recurrent Neural Networks (RNNs) (Basic RNN for time-series/text)
Long Short-Term Memory (LSTMs) (For NLP and sequential tasks)

Advanced Deep Learning

Transformers (Self-Attention, Positional Encoding)

Installation

Clone the repository to your local machine:

git clone https://github.com/your-username/ScratchML.git
cd ScratchML

Ensure you have Python 3.7+ installed along with the required dependencies:

pip install -r requirements.txt

Project Structure

ScratchML/
├── supervised_learning/
│   ├── regression.py       # Contains regression models (Linear, Ridge, Lasso, Polynomial)
│   └── __init__.py         # Package initialization
├── deep_learning/          # (Planned) Deep learning models and utilities
├── tests/                  # Unit tests for the library
├── requirements.txt        # Python dependencies
└── README.md               # Project documentation

Goals

Educational Purpose: This library is designed to help developers and students understand the inner workings of machine learning and deep learning algorithms.
Extendability: The library is modular, making it easy to add new models and features.
Performance: While the focus is on understanding, efforts are made to ensure the models are efficient and scalable.

Contributing

Contributions are welcome! If you'd like to add new models, improve existing ones, or fix bugs, feel free to open a pull request or submit an issue.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

This project is inspired by the desire to learn and teach the fundamentals of machine learning and deep learning by building models from scratch.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
supervised_learning		supervised_learning
.gitignore		.gitignore
Linear Regression.ipynb		Linear Regression.ipynb
Neural Network.ipynb		Neural Network.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ScratchML

Features

Supervised Learning Models (Classical ML)

Regression (Predicting Continuous Values)

Classification (Predicting Discrete Labels)

Boosting & Advanced ML

Unsupervised Learning Models (No Labels)

Clustering Algorithms

Dimensionality Reduction & Feature Extraction

Deep Learning Models

Basic Neural Networks

Computer Vision

Sequence Models

Advanced Deep Learning

Installation

Project Structure

Goals

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ScratchML

Features

Supervised Learning Models (Classical ML)

Regression (Predicting Continuous Values)

Classification (Predicting Discrete Labels)

Boosting & Advanced ML

Unsupervised Learning Models (No Labels)

Clustering Algorithms

Dimensionality Reduction & Feature Extraction

Deep Learning Models

Basic Neural Networks

Computer Vision

Sequence Models

Advanced Deep Learning

Installation

Project Structure

Goals

Contributing

License

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages