Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published September 10, 2020 | Version v1.0.0
Dataset Open

Dataset of EvaLatin 2020

  • 1. Università Cattolica del Sacro Cuore
  • 2. Università di Bergamo

Description

This repository contains training and test data of EvaLatin 2020, the first campaign devoted to the evaluation of Natural Language Processing Tools for Latin. It also includes the evaluation script.

EvaLatin first edition have 2 tasks (i.e. Lemmatization and PoS tagging) each with 3 sub-tasks (i.e. Classical, Cross-Genre, Cross-Time). 

The EvaLatin 2020 dataset is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike (CC BY-NC-SA) 4.0 International.

Files

EvaLatin-2020.zip

Files (4.6 MB)

Name Size Download all
md5:c80d46e0b971259576cf872de9edda09
4.6 MB Preview Download

Additional details

Funding

LiLa – Linking Latin. Building a Knowledge Base of Linguistic Resources for Latin 769994
European Commission

References

  • Sprugnoli, R., Passarotti, M., Cecchini, F. M., & Pellegrini, M. (2020, May). Overview of the evalatin 2020 evaluation campaign. In Proceedings of LT4HALA 2020-1st Workshop on Language Technologies for Historical and Ancient Languages (pp. 105-110).