LatinCy NER

Curated NER training data for LatinCy Latin language models.

Status

This dataset is currently in preparation and will be available shortly.

Entity types: PERSON, LOC, NORP
Sources: Universal Dependencies, biblical texts, Latin primers, Tesserae, and other annotated corpora
Format: spaCy-compatible JSON singles with character-offset span annotations
Splits: Train, dev, and held-out document-level test set

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md