Notarius is a specialized data extraction service built at the Centre for Medieval Studies in Lublin.
Prerequisites: Python 3.12+, Docker, CUDA capable GPU (for local OCR inference)
# Install dependencies
uv sync
# Set up environment
cp .env.example .env
# Install pre-commit hooks
pre-commit install
# Run tests
uv run pytest
# Run dagster
dagster dev- Installation Guide - Detailed setup instructions for different environments
- Technical documentation - Explanation of the indexing and extraction strategies
- Contributing - How to contribute to the project
- Conventional Commits - Commit message conventions
- Architecture - System design and ETL pipeline structure
MIT License - see LICENSE file for details.