Conference paper Open Access

Challenges in Converting the Index Thomisticus Treebank into Universal Dependencies

Cecchini, Flavio Massimiliano; Passarotti, Marco; Marongiu, Paola; Zeman, Daniel

This  paper  describes  the  changes  applied  to the  original  process  used  to  convert  the Index Thomisticus Treebank, a corpus including texts in Medieval Latin by Thomas Aquinas, into the annotation style of Universal Dependencies.   The  changes  are  made  both  to  harmonise  the  Universal  Dependencies  version of  the Index  Thomisticus Treebank  with  the two other available Latin treebanks and to fix errors  and  inconsistencies  resulting  from  the original process. The paper details the treatment of different issues in PoS tagging, lemmatisation and assignment of dependency relations. Finally, it assesses the quality of the new conversion process by providing an evaluation against a gold standard.

Files (239.3 kB)
Name Size
239.3 kB Download
All versions This version
Views 1414
Downloads 1313
Data volume 3.1 MB3.1 MB
Unique views 88
Unique downloads 99


Cite as