Curator is a migration tool developed to aid The Center for the Study of New Testament Manuscripts in their back log of data that is not stored in their database.
Install the dependencies in the requirements.txt file
pip install -r requirements.txtRun the python script inside the directory with the PDFs.
python curator.py <filename>.pdfto process a specific PDF, or
python curator.pyto process every PDF in the directory.
Running the script will produce a JSON file containing the data which can be loaded into the database.