Dataset Open Access

lexibank/uralex: UraLex basic vocabulary dataset

Syrjänen, Kaj; Lehtinen, Jyri; Vesakoski, Outi; de Heer, Mervi; Suutari, Toni; Dunn, Michael; Määttä, Urho; Leino, Unni-Päivä

The UraLex basic vocabulary dataset has its origins in the basic vocabulary cognacy dataset collected by the research initiative BEDLAN (Biological Evolution and the Diversification of Languages), funded by the Kone Foundation between 2009-2013. The data has since been revised and expanded in follow-up research projects, including SumuraSyyni (2014-2016), UraLex (2014-2016) and AikaSyyni (2017-2020). The dataset has been compiled especially for the purposes of quantitative language classification/historical linguistics, such as Bayesian Inference of phylogeny.

Files (1.4 MB)
Name Size
lexibank/uralex-v1.0.zip
md5:703203fa3a4f2180b9c1cbe81059b78d
1.4 MB Download
122
25
views
downloads
All versions This version
Views 122122
Downloads 2525
Data volume 34.7 MB34.7 MB
Unique views 9393
Unique downloads 2222

Share

Cite as