CLIP-Handwritten-Recognition

This project implements a CLIP-style model for learning joint image-text embeddings on the MNIST dataset.

Features

• Developed a CLIP-style model for handwritten digit recognition and representation learning.

• Encodes images with a residual CNN and labels with a lightweight text encoder.

• Supports both classification and image similarity tasks.

Usage

Install dependencies: pip install -r requirements.txt
Train the model: python train.py
Run inference: python inference.py

Future Improvements

• Explore larger text embeddings for richer semantic representations.

• Incorporate data augmentation to improve generalization.

• Evaluate on more complex datasets beyond MNIST.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
clip.py		clip.py
dataset.py		dataset.py
img_encoder.py		img_encoder.py
inference.py		inference.py
text_encoder.py		text_encoder.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLIP-Handwritten-Recognition

Features

Usage

Future Improvements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CLIP-Handwritten-Recognition

Features

Usage

Future Improvements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages