Published February 1, 2020 | Version 1.2
Dataset Open

Late Latin Charter Treebank 2 (LLCT2), version 1.2

  • 1. University of Helsinki

Description

Version 1.2 of the Late Latin Charter Treebank 2 (LLCT2). Contains a number of minor corrections and replaces the version 1.0 published at Zenodo in 2019. Early Medieval Latin documentary texts from Italy between AD 774-897 with morphological and syntactic annotation. Latin Dependency Treebank (LDT) compatible linguistic annotation, CoNLL treebank format. Note that LLCT2 is also available open-access in the Universal Dependencies format at the website of the Universal Dependencies consortium. For a detailed description of the Late Latin Charter Treebanks, see the pre-print of the paper 'Late Latin Charter Treebank: contents and annotation', to be published in Corpora, 16:2 (2021), at the institutional repository of the University of Helsinki. See also Korkiakangas, T. and Lassila, M. (2013), Abbreviations, fragmentary words, formulaic language: treebanking medieval charter material, in Mambrini, F., Passarotti, M. and Sporleder, C., Proceedings of the third workshop on annotation of corpora for research in the humanities, pp. 61–72, and Korkiakangas, T. and Passarotti, M. (2011), Challenges in Annotating Medieval Latin Charters, in «Journal of Language Technology and Computational Linguistics», 26, pp. 103–114.

Files

Files (12.7 MB)

Name Size Download all
md5:6977adfbaea43c137ed7c5deb000c340
12.7 MB Download