Self-hostable PDF parsing engine for structured text and table extraction. Works locally or via Docker.
- Extract text from PDFs
- Extract tables
- Supports password-protected PDFs (user should know the password)
- Runs locally or via Docker
docker build -t open-pdf-parser . docker run -p 8000:8000 open-pdf-parser
pip install -r requirements.txt uvicorn app.main:app --reload