Published July 24, 2019 | Version v1
Conference paper Open

Harmonizing Different Lemmatization Strategies for Building a Knowledge Base of Linguistic Resources for Latin

  • 1. Università Cattolica del Sacro Cuore

Description

The interoperability between lemmatized corpora of Latin and other resources that use the lemma as indexing key is hampered by the multiple lemmatization strategies that different projects adopt. In this paper we discuss how we tackle the challenges raised by harmonizing different lemmatization criteria in a project that aims to connect linguistic resources for Latin using the Linked Data paradigm. The paper introduces the architecture supporting an open-ended, lemma-based Knowledge Base, built to make textual and lexical resources for Latin interoperable. Particularly, the paper describes the inclusion into the Knowledge Base of its lexical basis, of a word formation lexicon and of a lemmatized and syntactically annotated corpus.

Files

W19-4009.pdf

Files (252.1 kB)

Name Size Download all
md5:518c1b88a3c9ee18d06baaabab9c2878
252.1 kB Preview Download

Additional details

Funding

LiLa – Linking Latin. Building a Knowledge Base of Linguistic Resources for Latin 769994
European Commission