Published September 10, 2020
| Version v1.0.0
Dataset
Open
Dataset of EvaLatin 2020
- 1. Università Cattolica del Sacro Cuore
- 2. Università di Bergamo
Description
This repository contains training and test data of EvaLatin 2020, the first campaign devoted to the evaluation of Natural Language Processing Tools for Latin. It also includes the evaluation script.
EvaLatin first edition have 2 tasks (i.e. Lemmatization and PoS tagging) each with 3 sub-tasks (i.e. Classical, Cross-Genre, Cross-Time).
The EvaLatin 2020 dataset is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike (CC BY-NC-SA) 4.0 International.
Files
EvaLatin-2020.zip
Files
(4.6 MB)
Name | Size | Download all |
---|---|---|
md5:c80d46e0b971259576cf872de9edda09
|
4.6 MB | Preview Download |
Additional details
Related works
- References
- https://github.com/CIRCSE/LT4HALA/tree/master/data_and_doc (URL)
Funding
References
- Sprugnoli, R., Passarotti, M., Cecchini, F. M., & Pellegrini, M. (2020, May). Overview of the evalatin 2020 evaluation campaign. In Proceedings of LT4HALA 2020-1st Workshop on Language Technologies for Historical and Ancient Languages (pp. 105-110).