An advanced automation desktop application built with C# .NET to streamline the extraction of Terminal IDs (TID) from POS receipts within PDF documents.
- Batch PDF Processing: Select a folder and process multiple PDFs at once.
- Real-time Progress Tracking: A visual Progress Bar (0-100%) to keep you informed.
- Intelligent OCR: Powered by Tesseract to recognize text patterns within images.
- Direct Excel Export: Generates an organized report including Filename, Page Number, and TID Status.
- Ready-to-Download: Instant access to the results file upon completion.
Upon starting the app, you are greeted with a clean interface to select your source directory.

The system extracts pages, converts them to images, and runs the OCR engine. You can monitor the status via the progress bar.

Once finished, the user is notified, and a download button appears to access the generated Excel report.

- Scanning: The app identifies all
.pdffiles in the selected folder. - Conversion: Each PDF page is rendered into a high-resolution image.
- Extraction: The OCR engine scans for the keyword "TID" followed by numerical patterns.
- Logging: Results are stored in memory and then written to an Excel sheet using
ClosedXML. - Status Reporting: If a TID is unreadable, it is marked as
Not Foundfor manual review.
- C# / Windows Forms - User Interface
- Tesseract OCR - Text Recognition Engine
- iText7 / Magick.NET - PDF Image Extraction
- ClosedXML - Excel Report Generation
Amer-css - GitHub Profile