docTR-wrapper

Description

Wrapper for docTR end-to-end text detection and recognition.

Input

The wrapper takes a VideoDocument with TimeFrame annotations with label property (for example, from SWT app that classifies scenes). See input section of the app metadata for more details.

docTR Structured Output

From the docTR documentation

The docTR model returns a Document object

Here is the typical Document layout:

Document(
  (pages): [Page(
    dimensions=(340, 600)
    (blocks): [Block(
      (lines): [Line(
        (words): [
          Word(value='No.', confidence=0.91),
          Word(value='RECEIPT', confidence=0.99),
          Word(value='DATE', confidence=0.96),
        ]
      )]
      (artefacts): []
    )]
  )]
)

The docTR wrapper preserves this structured information in the output MMIF by creating lapps Paragraph Sentence and Token annotations corresponding to the Block, Line, and Word from the docTR output.

User instruction

General user instructions for CLAMS apps are available at CLAMS Apps documentation.

Below is a list of additional information specific to this app.

System requirements

Requires mmif-python[cv] for the VideoDocument helper functions
Requires GPU to run at a reasonable speed

Configurable runtime parameter

For the full list of parameters, please refer to the app metadata from the CLAMS App Directory or the metadata.py file in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github/workflows		.github/workflows
.dockerignore		.dockerignore
.gitignore		.gitignore
Containerfile		Containerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
cli.py		cli.py
metadata.py		metadata.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

docTR-wrapper

Description

Input

docTR Structured Output

User instruction

System requirements

Configurable runtime parameter

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

License

clamsproject/app-doctr-wrapper

Folders and files

Latest commit

History

Repository files navigation

docTR-wrapper

Description

Input

docTR Structured Output

User instruction

System requirements

Configurable runtime parameter

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

Packages