AI Optical Character Recognition

This repository contains the Matlab code that uses 2 custom built machine learning algorithms (K-nearest Neighbour) to perform Optical Character Recognition. It also includes the EMNIST dataset of 26,000 images and labels on which to train and test the model. A 'summary.pdf' file has been provided to discuss my findings.

Running the Code -

The code loads the data in from the EMNIST dataset. It splits the data into 50% for training and 50% for testing to avoid overfitting. This means each model runs 13,000 Images so can take several minutes. The models used are -

Custom K-nearest Neighbour using Euclidean Distance.
Custom K-nearest Neighbour using Manhattan Distance.
MatLab K-nearest Neighbour
MatLab SVM for Multiclass

All that is needed to run the code is the main.m file and the dataset-letters file.

Files Created -

The code will create a folder called 'Results' in the current directory and create files 'Dataset.png' and 'Confusion.png'.

'Dataset.png' - Provides an image of a small sample of EMNIST Images and labels.

'Confusion.png' - Provides confusion charts of the 4 models once they have finished running

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
dataset-letters.mat		dataset-letters.mat
main.m		main.m
summary.pdf		summary.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Optical Character Recognition

Running the Code -

Files Created -

About

Uh oh!

Languages

JamesSharrock/AI-OCR

Folders and files

Latest commit

History

Repository files navigation

AI Optical Character Recognition

Running the Code -

Files Created -

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages