A tensorflow siamese network implementation. Illustrated using singature recognition/identification.
-
Updated
Jun 25, 2019 - Python
A tensorflow siamese network implementation. Illustrated using singature recognition/identification.
A repository with anonymized invoices
~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.
Creates synthetic degraded image documents that could be used to train Neural Networks
A synthetic data generator for text recognition
Online-handwritten version of the George Washington Dataset.
Tools necessary to perform a multi-fold pretrained voting approach utlizing OCRopus.
A selection of test lines of several early printed books as well as the corresponding individual OCRopus models and mixed models.
Code and procdures for handwriting object detection and recognition
ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation (CVPR20)
Dataset for scene text removal
This Web application crawls PDFs from governement websites, performs table detection and displays advanced statistics.
Generate text images for training deep learning ocr model
Total Text Dataset - ICDAR 2017. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
A tensorflow reproducing of paper “Editing Text in the wild”
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
Distorted Document Images dataset (DDI-100).
Add a description, image, and links to the aniketdata topic page so that developers can more easily learn about it.
To associate your repository with the aniketdata topic, visit your repo's landing page and select "manage topics."