Dataset Open Access

SemUr - Semantic Databases for Uralic Languages

Hämäläinen, Mika

These databases are translated from SemFi by using Giellatekno XML dictionaries. The included python script can be used to update these databases or to create new ones for other languages.

Currently, SemUr has the following languages

  • SemSms - Skolt Sami
  • SemKpv - Komi Zyrian
  • SemMyv - Erzya
  • SemMdf - Moksha

 

Cite as

Hämäläinen, Mika. (2018). Extracting a Semantic Database with Syntactic Relations for Finnish to Boost Resources for Endangered Uralic Languages. In The Proceedings of Logic and Engineering of Natural Language Semantics 15 (LENLS15)

 

Files (2.4 GB)
Name Size
semfi_translate.py
md5:53dace2e45136935316fabe3f8f2f0d2
8.0 kB Download
semkpv.db
md5:529e5aa351c863deeafe2c5fd9e77725
547.2 MB Download
semmdf.db
md5:500b8539a17e438f94d4a36697f8586c
400.1 MB Download
semmyv.db
md5:e141d27378c77bc7320d8d06094733ac
539.9 MB Download
semsms.db
md5:4904a566df9699d9d9af87a18d58ac1a
884.1 MB Download
122
196
views
downloads
All versions This version
Views 12254
Downloads 196175
Data volume 99.2 GB91.8 GB
Unique views 10650
Unique downloads 155149

Share

Cite as