Published May 22, 2025 | Version conference_presentation
Presentation Open

Morph-Tags and Authorship(s)

  • 1. Principal, Clavis Consulting, LLC, Kansas
  • 2. St Mary's University, Twickenham, London

Description

Morphological tagging of ancient Greek linguistic corpora is approaching 50 years. Notable projects include CCAT/CATSS (1977-), its commercial spinoffs (BibleWorks, 1992-; Logos, 1992-; Accordance, 1994-). TLG (2006-), PROIEL (2007-), AGDT 1.0 (2009-), AGDT 2.0 (2014-), Pedalion (2019-), GLAUx (2021-), and OGA (2023-). Together with the Diorisis Ancient Greek Corpus (2018-), the more recent projects have built corpora of 10s of millions of tokens.

In light of all of these developments, what should we do and where should we go? Continue building international collaborations, curate customized datasets, and make linguistic phenemena more easily searchable and citable! Two new corpora of 32M+ tokens each (AGDTmini and CATnaPS) are here released and a starter Jupyter Notebook provided to start to make use of each repository.

Files

mgbilby/NAPS25DH-conference_presentation.zip

Files (6.2 MB)

Name Size Download all
md5:f9ce22d039ec9881ac30c6fed997cc5d
6.2 MB Preview Download

Additional details

Dates

Available
2025-05-22

Software

Repository URL
https://github.com/mgbilby/NAPS25DH
Programming language
R , SCSS , JavaScript
Development Status
Active