Published November 18, 2019 | Version v1
Presentation Open

Vir is to Moderatus as Mulier is to Intemperans - Lemma Embeddings for Latin

  • 1. Università Cattolica del Sacro Cuore

Description

Presentation at CLiC-it 2019 - Sixth Italian Conference on Computational Linguistics.

This paper presents a new set of lemma embeddings for the Latin language. Embeddings are trained on a manually annotated corpus of texts belonging to the Classical era: different models, architectures and dimensions are tested and evaluated using a novel benchmark for the synonym selection task. A qualitative evaluation is also performed on the embeddings of rare lemmas. In addition, we release vectors pre-trained on the “Opera Maiora” by Thomas Aquinas, thus providing a resource to analyze Latin in a diachronic perspective.

Files

CLiC_it2019_Sprugnoli_et_al_Slide.pdf

Files (852.3 kB)

Name Size Download all
md5:4779903ed68ea502a5a29529a7889351
852.3 kB Preview Download

Additional details

Funding

LiLa – Linking Latin. Building a Knowledge Base of Linguistic Resources for Latin 769994
European Commission