Dataset Open Access

lexibank/uralex: UraLex basic vocabulary dataset

Syrjänen, Kaj; Lehtinen, Jyri; Vesakoski, Outi; de Heer, Mervi; Suutari, Toni; Dunn, Michael; Määttä, Urho; Leino, Unni-Päivä

The UraLex basic vocabulary dataset has its origins in the basic vocabulary cognacy dataset collected by the research initiative BEDLAN (Biological Evolution and the Diversification of Languages), funded by the Kone Foundation between 2009-2013. The data has since been revised and expanded in follow-up research projects, including SumuraSyyni (2014-2016), UraLex (2014-2016) and AikaSyyni (2017-2020). The dataset has been compiled especially for the purposes of quantitative language classification/historical linguistics, such as Bayesian Inference of phylogeny.

Files (1.4 MB)
Name Size
lexibank/uralex-v1.0.zip
md5:703203fa3a4f2180b9c1cbe81059b78d
1.4 MB Download
214
66
views
downloads
All versions This version
Views 214214
Downloads 6666
Data volume 91.6 MB91.6 MB
Unique views 173173
Unique downloads 5151

Share

Cite as