Skip to content

An R package to generate ground truth data for Tesseract

License

Unknown, MIT licenses found

Licenses found

Unknown
LICENSE
MIT
LICENSE.md
Notifications You must be signed in to change notification settings

arcruz0/tesseractgt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tesseractgt: Generate Ground Truth Data for Tesseract

In an early stage (v0.0.4). Check out the guide with a fully developed example and the package documentation.

GIF showing how the tesseractgt GUI works

Step-by-step summary

(Go to the guide for more detailed explanations of each step.)

  1. Install the Tesseract engine, tesseract, and tesseractgt.

  2. Take screenshots to serve as fine-tuning images, and save them in a folder.

  3. Use tesseractgt::create_gt_txt() to create and pre-fill files with the text in the images.

  4. Call the GUI from "Addins > Correct ground truth files" in RStudio or tesseractgt::correct_gt_txt(). Use it to correct the text files created in step 2.

  5. Fine-tune using tesstrain.

Installation

install.packages("remotes") # if `remotes` is not installed
remotes::install_github("arcruz0/tesseractgt")

About

An R package to generate ground truth data for Tesseract

Resources

License

Unknown, MIT licenses found

Licenses found

Unknown
LICENSE
MIT
LICENSE.md

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages