OCR-APP

Application Architecture

Overview

This application is built using Flask for the backend and React for the frontend. It integrates with MongoDB for data storage and uses Google Cloud Vision API for OCR (Optical Character Recognition). The application allows users to upload images of Thai ID cards, extract relevant information using OCR, and manage the extracted data through a CRUD interface.

Application Demo Video - https://www.loom.com/share/551da3436b124ec89b23b49790ff8396?sid=295e08f9-5361-448d-b05f-573571881f5e

Code Explanation Video- https://www.loom.com/share/f0501da8a36848ef9ff52107da7ab9d3?sid=70ce2476-b4d5-4993-9d29-5a8b534a102e

Technologies Used

Backend: Flask
Frontend: React
Database: MongoDB
OCR: Google Cloud Vision API
Styling: Tailwind CSS (or Bootstrap, if preferred)
State Management: React Hook Form

Backend

Flask Application

The Flask application handles the following tasks:

OCR Processing:
- Uses Google Cloud Vision API to detect text in uploaded images.
- Extracts relevant information using regular expressions.
CRUD Operations:
- Provides endpoints to create, read, update, and delete OCR data in MongoDB.
CORS:
- Enabled using flask-cors to allow cross-origin requests from the React frontend.

Using this application you can Scan Thai ID cards using Google Cloud Vision API and get following information

Identification Number
Name
Last Name
Date of Birth
Date of Issue
Date of Expiry

To use this application, you will need following things

Cloud Vision Credentials to use the API

Here are the steps in which you can access the application after you have successfully added your credentials path in main.py

cd into the server folder and run - python main.py into the terminal which will start your development server
cd into the source folder and run - npm run dev into the terminal which will start your React client

On the main page, click on no file chosen and select the image, after the image is selected click on Upload. You will see the results on the right panel like this

Click on Save Button to save the details into the database.(MongoDB)

After Saving navigate to /data endpoint which is the OCR Management Console, where you can Create,Update,Delete and Read all the OCR Results available in the Database.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
public		public
server		server
src		src
.env		.env
.eslintrc.cjs		.eslintrc.cjs
.gitignore		.gitignore
README.md		README.md
components.json		components.json
image-1.png		image-1.png
image.png		image.png
index.html		index.html
jsconfig.json		jsconfig.json
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR-APP

Application Architecture

Overview

Technologies Used

Backend

Flask Application

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OCR-APP

Application Architecture

Overview

Technologies Used

Backend

Flask Application

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages