Dataset Open Access

lexibank/uralex: UraLex basic vocabulary dataset

Syrjänen, Kaj; Lehtinen, Jyri; Vesakoski, Outi; de Heer, Mervi; Suutari, Toni; Dunn, Michael; Määttä, Urho; Leino, Unni-Päivä

The UraLex basic vocabulary dataset has its origins in the basic vocabulary cognacy dataset collected by the research initiative BEDLAN (Biological Evolution and the Diversification of Languages), funded by the Kone Foundation between 2009-2013. The data has since been revised and expanded in follow-up research projects, including SumuraSyyni (2014-2016), UraLex (2014-2016) and AikaSyyni (2017-2020). The dataset has been compiled especially for the purposes of quantitative language classification/historical linguistics, such as Bayesian Inference of phylogeny.

Files (1.4 MB)
Name Size
lexibank/uralex-v1.0.zip
md5:703203fa3a4f2180b9c1cbe81059b78d
1.4 MB Download
85
19
views
downloads
All versions This version
Views 8585
Downloads 1919
Data volume 26.4 MB26.4 MB
Unique views 6363
Unique downloads 1616

Share

Cite as