Morph-Tags and Authorship(s)
Authors/Creators
- 1. Principal, Clavis Consulting, LLC, Kansas
- 2. St Mary's University, Twickenham, London
Description
Morphological tagging of ancient Greek linguistic corpora is approaching 50 years. Notable projects include CCAT/CATSS (1977-), its commercial spinoffs (BibleWorks, 1992-; Logos, 1992-; Accordance, 1994-). TLG (2006-), PROIEL (2007-), AGDT 1.0 (2009-), AGDT 2.0 (2014-), Pedalion (2019-), GLAUx (2021-), and OGA (2023-). Together with the Diorisis Ancient Greek Corpus (2018-), the more recent projects have built corpora of 10s of millions of tokens.
In light of all of these developments, what should we do and where should we go? Continue building international collaborations, curate customized datasets, and make linguistic phenemena more easily searchable and citable! Two new corpora of 32M+ tokens each (AGDTmini and CATnaPS) are here released and a starter Jupyter Notebook provided to start to make use of each repository.
Files
mgbilby/NAPS25DH-conference_presentation.zip
Files
(6.2 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:f9ce22d039ec9881ac30c6fed997cc5d
|
6.2 MB | Preview Download |
Additional details
Related works
- Compiles
- Software: https://github.com/mgbilby/NAPS25DH/tree/conference_presentation (URL)
- Describes
- Dataset: 10.5281/zenodo.15508450 (DOI)
- Dataset: 10.5281/zenodo.15508287 (DOI)
Dates
- Available
-
2025-05-22
Software
- Repository URL
- https://github.com/mgbilby/NAPS25DH
- Programming language
- R , SCSS , JavaScript
- Development Status
- Active