Published September 7, 2019 | Version pre-print
Conference paper Open

CogNet: a Large-Scale Cognate Database

  • 1. University of Trento

Description

This paper introduces CogNet, a new, large-scale lexical database that provides cognates—words of common origin and meaning—across languages. The database currently contains 3.1 million cognate pairs across 338 languages using 35 writing systems. The paper also describes the automated method by which cognates were computed from publicly available wordnets, with an accuracy evaluated to 94%. Finally, statistics and early insights about the cognate data are presented, hinting at a possible future exploitation of the resource by various fields of lingustics.

Files

Huygaa-P19-1302.pdf

Files (2.2 MB)

Name Size Download all
md5:2a746f3549a11f1b2da9e991e5de9a2b
2.2 MB Preview Download

Additional details

Funding

CyCAT – Cyprus Center for Algorithmic Transparency 810105
European Commission