There is a newer version of the record available.

Published May 16, 2020 | Version v1
Conference paper Open

A New Latin Treebank for Universal Dependencies: Charters between Ancient Latin and Romance Languages

  • 1. Università Cattolica del Sacro Cuore, Milan, Italy
  • 2. University of Helsinki, Finland

Description

obtained from the automated conversion of the Late Latin Charter Treebank 2 (LLCT2), originally in the Prague Dependency Treebank (PDT) style. As this treebank consists of Early Medieval legal documents, its language variety differs considerably from both the Classical and Medieval learned varieties prevalent in the other currently available UD Latin treebanks. Consequently, besides significant phenomena from the perspective of diachronic linguistics, this treebank also poses several challenging technical issues for the current and future syntactic annotation of Latin in the UD framework. Some of the most relevant cases are discussed in depth, with comparisons between the original PDT and the resulting UD annotations. Additionally, an overview of the UD-style structure of the treebank is given, and some diachronic aspects of the transition from Latin to Romance languages are highlighted.

Files

2020_Cecchini-et-alii_LREC_LLCT_UD.pdf

Files (305.5 kB)

Name Size Download all
md5:2220999fdf1ee7778272e2bfdf3fffc6
305.5 kB Preview Download

Additional details

Funding

LiLa – Linking Latin. Building a Knowledge Base of Linguistic Resources for Latin 769994
European Commission