Skip to content

latincy/latincy-ner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

LatinCy NER

Curated NER training data for LatinCy Latin language models.

Status

This dataset is currently in preparation and will be available shortly.

Overview

  • Entity types: PERSON, LOC, NORP
  • Sources: Universal Dependencies, biblical texts, Latin primers, Tesserae, and other annotated corpora
  • Format: spaCy-compatible JSON singles with character-offset span annotations
  • Splits: Train, dev, and held-out document-level test set

License

CC BY 4.0

About

Curated NER training data for LatinCy Latin language models (PERSON, LOC, NORP)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors