OCR Scanner App

A Flutter mobile application that uses camera-based OCR (Optical Character Recognition) to extract numbers from images. The app provides a live camera preview with an overlay guide, captures images, processes them to extract numeric content, and stores the results with a complete history management system.

Features

Enhanced Camera Integration

Live camera preview with animated rectangle overlay guide
Optimized rectangular frame (85% width, 30% height) for better number scanning
Animated semi-transparent mask with gradient borders and pulse effects
Enhanced capture button with haptic feedback and smooth animations
Comprehensive camera permissions handling with guided user flow
Graceful handling of all permission states (granted, denied, permanently denied)
App lifecycle management with automatic permission re-checking
Refresh functionality for users returning from device settings

Advanced Image Processing & OCR

Crops captured images to exact rectangle bounds in pixels with precise coordinate mapping
OCR functionality to extract digits (0-9) from cropped images using Google ML Kit
Handles number formats including decimals (e.g., 123.45)
Returns extracted numbers as array of strings in reading order (top-to-bottom, left-to-right)
Filters out non-numeric content from OCR results using regex patterns
Always saves cropped images regardless of OCR success for complete scan history

Enhanced Data Management & UI

Stores scan results with unique ID, timestamp, and extracted numbers array
Saves cropped image thumbnails for preview (always saved, even with no OCR results)
Modern, card-based history list screen with gradient icons and improved typography
Comprehensive detail screen with enhanced visual design and copy functionality
Beautiful empty states with call-to-action buttons
Improved error handling with user-friendly messages and retry options
Bulk operations (clear all scans) with confirmation dialogs

Premium User Experience

Intuitive camera permission flow with educational messaging
Comprehensive failure scenario handling with guided recovery options
Optimized app performance with smooth 60fps animations
Modern Material Design 3 inspired UI with gradient effects
Haptic feedback for better interaction feel
Enhanced copy-to-clipboard functionality with visual confirmation
Floating snackbars with action buttons for improved UX
App lifecycle management for seamless camera transitions

Technical Implementation

Libraries Used

camera: ^0.10.5+9 - Camera functionality and live preview
permission_handler: ^11.3.1 - Runtime permission management
google_mlkit_text_recognition: ^0.13.0 - OCR text recognition
path_provider: ^2.1.3 - File system path management
sqflite: ^2.3.3+1 - Local SQLite database storage
image: ^4.1.7 - Image processing and cropping
path: ^1.8.3 - Path manipulation utilities

Architecture

The app follows a clean architecture pattern with separation of concerns:

Models: Data structures for scan results
Services: Business logic for OCR processing and database operations
Screens: UI components for different app screens
Widgets: Reusable UI components like camera overlay

Rectangle-to-Image Coordinate Mapping

The app uses a fixed rectangle overlay that represents 85% of screen width and 30% of screen height, centered on the screen. The coordinate mapping logic:

Screen Coordinates: Rectangle position calculated as percentages of screen dimensions
Image Coordinates: Converted to pixel coordinates based on captured image dimensions
Cropping: Uses relative coordinates (0.0-1.0) that are then mapped to actual pixel values
Bounds Checking: Ensures crop coordinates stay within image boundaries

// Rectangle dimensions as percentages
const double rectWidth = 0.85;  // 85% of screen width
const double rectHeight = 0.30; // 30% of screen height
const double rectX = (1.0 - rectWidth) / 2;   // Centered horizontally
const double rectY = (1.0 - rectHeight) / 2;  // Centered vertically

// Convert to pixel coordinates
int x = (cropX * image.width).round();
int y = (cropY * image.height).round();
int width = (cropWidth * image.width).round();
int height = (cropHeight * image.height).round();

OCR Implementation

The OCR implementation uses Google ML Kit's text recognition:

Text Recognition: Processes the cropped image to extract all text
Text Block Sorting: Sorts detected text blocks by position (top-to-bottom, left-to-right)
Number Extraction: Uses regex pattern \d+\.?\d* to extract numbers including decimals
Result Filtering: Filters out non-numeric content and empty matches

Database Schema

SQLite database with a single table for scan results:

CREATE TABLE scan_results(
  id TEXT PRIMARY KEY,
  timestamp INTEGER NOT NULL,
  extractedNumbers TEXT NOT NULL,  -- Comma-separated string
  imagePath TEXT
)

Setup Instructions

Prerequisites

Flutter SDK (>=3.4.0)
Android Studio / Xcode for device deployment
Physical device (camera functionality requires real device)

Android Setup

Clone the repository
```
git clone <repository-url>
cd ocr
```
Install dependencies
```
flutter pub get
```

Android permissions (already configured in android/app/src/main/AndroidManifest.xml):

<uses-permission android:name="android.permission.CAMERA" />
<uses-permission android:name="android.permission.WRITE_EXTERNAL_STORAGE" />
<uses-permission android:name="android.permission.READ_EXTERNAL_STORAGE" />

Run on Android device
```
flutter run
```

iOS Setup

iOS permissions (already configured in ios/Runner/Info.plist):

<key>NSCameraUsageDescription</key>
<string>This app needs camera access to scan numbers from images</string>
<key>NSPhotoLibraryUsageDescription</key>
<string>This app needs photo library access to save scanned images</string>

Run on iOS device
```
flutter run
```

Build for Release

Android APK:

flutter build apk --release

iOS IPA:

flutter build ios --release

Known Limitations

OCR Accuracy: Recognition accuracy depends on image quality, lighting conditions, and text clarity
Number Format Support: Currently optimized for standard numeric formats; may have issues with unusual number representations
Language Support: Optimized for English numeric characters
Performance: OCR processing time varies based on image complexity and device performance
Storage: Cropped images are stored locally and may accumulate over time
Camera Resolution: Fixed to high resolution which may impact performance on older devices

Potential Improvements

Enhanced OCR:
- Support for more number formats and currencies
- Confidence scoring for extracted numbers
- Multiple OCR engine fallbacks
User Experience:
- Manual crop adjustment interface
- Batch processing of multiple images
- Export functionality (CSV, JSON)
- Cloud backup and sync
Performance:
- Background processing for OCR
- Image compression optimization
- Caching mechanisms
Features:
- Text-to-speech for extracted numbers
- Calculator integration
- Custom rectangle size adjustment
- Multiple language support

Demo Link:

Youtube- https://youtube.com/shorts/vApFy5k_seM?feature=share

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
android		android
ios		ios
lib		lib
linux		linux
macos		macos
test		test
web		web
windows		windows
.gitignore		.gitignore
.metadata		.metadata
IMPROVEMENTS_SUMMARY.md		IMPROVEMENTS_SUMMARY.md
PERMISSION_AND_RECTANGLE_FIXES.md		PERMISSION_AND_RECTANGLE_FIXES.md
PROJECT_SUMMARY.md		PROJECT_SUMMARY.md
README.md		README.md
analysis_options.yaml		analysis_options.yaml
pubspec.lock		pubspec.lock
pubspec.yaml		pubspec.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Scanner App

Features

Enhanced Camera Integration

Advanced Image Processing & OCR

Enhanced Data Management & UI

Premium User Experience

Technical Implementation

Libraries Used

Architecture

Rectangle-to-Image Coordinate Mapping

OCR Implementation

Database Schema

Setup Instructions

Prerequisites

Android Setup

iOS Setup

Build for Release

Known Limitations

Potential Improvements

Demo Link:

About

Uh oh!

Releases

Packages

Languages

SharmaNityam/brackets-ocr

Folders and files

Latest commit

History

Repository files navigation

OCR Scanner App

Features

Enhanced Camera Integration

Advanced Image Processing & OCR

Enhanced Data Management & UI

Premium User Experience

Technical Implementation

Libraries Used

Architecture

Rectangle-to-Image Coordinate Mapping

OCR Implementation

Database Schema

Setup Instructions

Prerequisites

Android Setup

iOS Setup

Build for Release

Known Limitations

Potential Improvements

Demo Link:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages