GitHub - kylianmthr/OCR

A lightweight Optical Character Recognition (OCR) engine built from the ground up using only NumPy. This project was created to dive into the fundamentals of Artificial Intelligence without relying on heavy frameworks like PyTorch or TensorFlow.

🚀 Overview

This implementation focuses on the classic MNIST dataset (handwritten digits). It uses a custom-built neural network architecture with a Softmax activation layer for multi-class classification.

Key features:

Zero deep learning frameworks: Pure mathematical implementation using NumPy.
Softmax integration: For robust probability distribution over the 10 digit classes.
Fast environment: Managed with uv for ultra-fast dependency resolution.

🛠️ Installation

This project uses uv for Python package management.

# Clone the repository
git clone https://github.com/kylianmthr/OCR/
cd OCR

# Install dependencies and setup environment
make install

📂 Usage

Training the Model

You can train the model using the raw MNIST binary files:

python main.py --train <path_to_train_images> <path_to_train_labels>

Alternatively, if configured in your Makefile:

make train

Inference (Prediction)

To run the OCR on a specific image, ensure your input is a 28x28 PNG file.

make exec
or
python main.py --exec <path_to_weights.npy> <path_to_image.png>

📊 Performance

Note

This project was developed for educational purposes to understand the "math behind the magic." While functional, the accuracy is not meant to compete with SOTA (State-Of-The-Art) convolutional models, but rather to demonstrate the feasibility of a "from scratch" approach.

📝 To-Do / Improvement ideas

Add Convolutional layers (CNN) also from scratch.
Implement data augmentation to improve accuracy.
Add a visualization tool for the weight matrices.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
.python-version		.python-version
Makefile		Makefile
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Overview

🛠️ Installation

📂 Usage

Training the Model

Inference (Prediction)

📊 Performance

📝 To-Do / Improvement ideas

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚀 Overview

🛠️ Installation

📂 Usage

Training the Model

Inference (Prediction)

📊 Performance

📝 To-Do / Improvement ideas

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages