Conference paper Open Access
Cecchini, Flavio Massimiliano;
Sprugnoli, Rachele;
Moretti, Giovanni;
Passarotti, Marco
This paper presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.
Name | Size | |
---|---|---|
2020_Cecchini-et-alii_UDante_CLiC-it.pdf
md5:6b62711759ce48d95b83bf306e731a77 |
289.9 kB | Download |
All versions | This version | |
---|---|---|
Views | 145 | 145 |
Downloads | 80 | 80 |
Data volume | 23.2 MB | 23.2 MB |
Unique views | 130 | 130 |
Unique downloads | 75 | 75 |