Dataset Open Access

lexibank/uralex: UraLex basic vocabulary dataset

Syrjänen, Kaj; Lehtinen, Jyri; Vesakoski, Outi; de Heer, Mervi; Suutari, Toni; Dunn, Michael; Määttä, Urho; Leino, Unni-Päivä

The UraLex basic vocabulary dataset has its origins in the basic vocabulary cognacy dataset collected by the research initiative BEDLAN (Biological Evolution and the Diversification of Languages), funded by the Kone Foundation between 2009-2013. The data has since been revised and expanded in follow-up research projects, including SumuraSyyni (2014-2016), UraLex (2014-2016) and AikaSyyni (2017-2020). The dataset has been compiled especially for the purposes of quantitative language classification/historical linguistics, such as Bayesian Inference of phylogeny.

Files (1.4 MB)
Name Size
lexibank/uralex-v1.0.zip
md5:703203fa3a4f2180b9c1cbe81059b78d
1.4 MB Download
151
27
views
downloads
All versions This version
Views 151151
Downloads 2727
Data volume 37.5 MB37.5 MB
Unique views 118118
Unique downloads 2424

Share

Cite as