Dataset Open Access

SemUr - Semantic Databases for Uralic Languages

Hämäläinen, Mika

These databases are translated from SemFi by using Giellatekno XML dictionaries. The included python script can be used to update these databases or to create new ones for other languages.

Currently, SemUr has the following languages

  • SemSms - Skolt Sami
  • SemKpv - Komi Zyrian
  • SemMyv - Erzya
  • SemMdf - Moksha

 

Cite as

Hämäläinen, Mika. (2018). Extracting a Semantic Database with Syntactic Relations for Finnish to Boost Resources for Endangered Uralic Languages. In The Proceedings of Logic and Engineering of Natural Language Semantics 15 (LENLS15)

 

Files (2.4 GB)
Name Size
semfi_translate.py
md5:53dace2e45136935316fabe3f8f2f0d2
8.0 kB Download
semkpv.db
md5:529e5aa351c863deeafe2c5fd9e77725
547.2 MB Download
semmdf.db
md5:500b8539a17e438f94d4a36697f8586c
400.1 MB Download
semmyv.db
md5:e141d27378c77bc7320d8d06094733ac
539.9 MB Download
semsms.db
md5:4904a566df9699d9d9af87a18d58ac1a
884.1 MB Download
85
165
views
downloads
All versions This version
Views 8535
Downloads 165145
Data volume 84.9 GB77.6 GB
Unique views 7331
Unique downloads 126120

Share

Cite as