Skip to content

Create a script for transforming PDF to text #1

@liadmagen

Description

@liadmagen

add in src/papers/data/ a script that transforms the papers (in data/papers/raw) from a pdf to a text.
The outcome should be saved in data/papers/interim/ folder.

Use pdfminer.six or any other service to do so.

Optional: use a pre-made docker container as a service for it.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions