Skip to content

Ancient Greek morphosyntactic analyzer and parser of declensions and conjugations. This version is a two-layer model using hidden sigmoid layer, softmax output layer, and cross-entropy loss fxn. This version only predicts verb voice. Data set is Opera Graecae Adnotada corpus, 40MM Ancient Greek word tokens in 1,999 texts, not included.

Notifications You must be signed in to change notification settings

user12151985/opera_graeca_verb_parser

Repository files navigation

Info.

This is an Ancient Greek morphosyntactic analyzer and parser. It parses declensions and conjugations. This version is a two-layer model using a hidden sigmoid layer, linear output layer with softmax, and cross-entropy loss fxn. This version only predicts verb voice conjugation.

Data set is the Opera Graecae Adnotada (OGA) corpus, 40MM Ancient Greek word tokens across 1,999 texts, not included here. See below.

Setup.

  • unzip the main project folder.

  • Download, move, and unzip into the above main proj folder: Opera Graeca Adnotata from Zenodo or GitHub. This is two main folders OGA-main (5.9MB)and opera_graeca_adnotata_v0.2.0 (66GB) links:

  • Create and activate a virtual environment.

    • Change into your new, cloned and unzipped project directory $cd /Users/admin/Desktop/Opera_Graeca_Verb_Parser

    • create venv $python3 -m venv .venv when starting again in future, skip this step and just activate

    • activate $source .venv/bin/activate

  • install dependencies

    • $pip install -r requirements.txt
  • set an environment variable OGA_ROOT to the correct directory path of the OGA on your machine.

    • make sure you're in the right dir $cd /Users/admin/Desktop/Opera_Graeca_Verb_Parser
    • $export OGA_ROOT=/Users/admin/Desktop/Opera_Graeca_Verb_Parser/opera_graeca_adnotata/opera_graeca_adnotata_v0.2.0
    • confirm with $python3 -c "import os; print(os.environ.get('OGA_ROOT'))"
  • start jupyter

    • $jupyter notebook

Shutdown.

  • exit

    • ctrl+c
  • deactivate venv

    • $deactivate

About

Ancient Greek morphosyntactic analyzer and parser of declensions and conjugations. This version is a two-layer model using hidden sigmoid layer, softmax output layer, and cross-entropy loss fxn. This version only predicts verb voice. Data set is Opera Graecae Adnotada corpus, 40MM Ancient Greek word tokens in 1,999 texts, not included.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published