MNIST digit recognizer using Multilayer Perceptron in NumPy

A from-scratch NumPy implementation of a multilayer perceptron trained on MNIST, built to understand backpropagation at the mathematical level.

Accuracy: ~93% on the MNIST test set

Why This Exists

The neural network is trained using backpropagation and batch gradient descent, without relying on high-level frameworks that hide the math. TensorFlow is used only to load the MNIST dataset. Code is written to maximize readability over performance.

This series by 3b1b on neural networks was inspiration for this project.

Model Architecture

Layer	Details
Input	784 neurons (28×28 flattened image)
Hidden	3 hidden layers, 20 neurons each
Output	10 neurons, output values in range [0,1]

Screenshots

Sample digit prediction and confusion matrix after training.

Usage

Make a virtual environment and install requirements given in requirements.txt

python3 -m venv .venv
pip install -r requirements.txt

Run train.py. It will save the data (weights and biases) to brain.npz (database is downloaded automatically using TensorFlow) After training, run usage.py. This can be used to find accuracy, confusion matrix and sample predictions.

The math

Weights and biases are randomly initialised. The sigmoid function is used as the activation function for all layers. Data from MNIST database is loaded as NumPy arrays, flattened and normalized. Cost is calculated using Mean Squared Error (MSE).

Limitations

Uses sigmoid instead of ReLU.
Uses MSE instead of cross-entropy
Not optimized for performance

For notes on the calculus (derivatives, chain rule, and cost functions) used in this project, please refer to the docs/ folder.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
docs		docs
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
train.py		train.py
usage.py		usage.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MNIST digit recognizer using Multilayer Perceptron in NumPy

Why This Exists

Model Architecture

Screenshots

Usage

The math

Limitations

Make sure to use a LaTeX compatible viewer (like obsidian or VS Code).

About

Uh oh!

Languages

License

ameyakakade/mnist-mlp-numpy

Folders and files

Latest commit

History

Repository files navigation

MNIST digit recognizer using Multilayer Perceptron in NumPy

Why This Exists

Model Architecture

Screenshots

Usage

The math

Limitations

Make sure to use a LaTeX compatible viewer (like obsidian or VS Code).

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages