how to proceed?

nice work

First a question: 
could you provide your model -> model66.zip. I do not have a decent GPU :-( otherwise how much time it would take on google cloud with a GPU (T4, K80, P100, V100, P4)? my current val_loss at epoch 188 is 0.27689?!

some missing links:
While going through the steps in the python training script I found that one need jpg instead of bmp for tensorflow right?
For one xml "10.1.1.1.xml" no BMP (JPG) exist. That is why the scrip crashs at this point.

At last another question:
I figured that the model get table mask and column mask. In the comment of the original paper I found this:

_Outputs: After the documents are processed using the model, the masks of tables and columns are generated. These masks are used to filter out the table and its column regions from the image. **Now using the Tesseract OCR**, the information is extracted from the segmented regions. Below is an image showing the masks that are generated and later extracted from the tables_

Any hint for me where I can find a good intro to extract the final table content from BMP/JPG picture and table / column mask using tesseract OCR?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to proceed? #5

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

how to proceed? #5

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions