Published December 12, 2020 | Version v1
Conference paper Open

UDante: First Steps Towards the Universal Dependencies Treebank of Dante's Latin Works

  • 1. Università Cattolica del Sacro Cuore, Milan, Italy

Description

This paper presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.

Files

2020_Cecchini-et-alii_UDante_CLiC-it.pdf

Files (289.9 kB)

Name Size Download all
md5:6b62711759ce48d95b83bf306e731a77
289.9 kB Preview Download

Additional details

Funding

LiLa – Linking Latin. Building a Knowledge Base of Linguistic Resources for Latin 769994
European Commission