Published June 17, 2022 | Version v1
Conference paper Open

Overview of the EvaLatin 2022 Evaluation Campaign

  • 1. Università di Parma, Italy
  • 2. Università Cattolica del Sacro Cuore, Milan, Italy
  • 3. KU Leuven

Description

This paper describes the organization and the results of the second edition of EvaLatin, the campaign for the evaluation of Natural Language Processing tools for Latin. The three shared tasks proposed in EvaLatin 2022, i. e. Lemmatization, Part-of-Speech Tagging and Features Identification, are aimed to foster research in the field of language technologies for Classical languages. The shared dataset consists of texts mainly taken from the LASLA corpus. More specifically, the training set includes only prose texts of the Classical period, whereas the test set is organized in three sub-tasks: a Classical sub-task on a prose text of an author not included in the training data, a Cross-genre sub-task on poetic and scientific texts, and a Cross-time sub-task on a text of the 15th century. The results obtained by the participants for each task and sub-task are presented and discussed.

Files

2022_Sprugnoli-et-al_Overview-Evalatin2.pdf

Files (595.6 kB)

Name Size Download all
md5:b3dad23dd2fe3d12224c126329c0893e
595.6 kB Preview Download

Additional details

Funding

LiLa – Linking Latin. Building a Knowledge Base of Linguistic Resources for Latin 769994
European Commission