There is a newer version of the record available.

Published August 4, 2021 | Version v1.0
Software Open

nikopartanen/komi-path-dataset: Komi Path Dataset

Creators

Description

This is an open access dataset that contains examples of via-cases in Komi varieties. Zyrian Komi is typologically unusual as it has two distinct cases to express this relatively rare function. Those cases are commonly called as prolative and transitive.

There is currently one dataset that represents standard written Komi, primarily from the early 20th century. The file that contains individual examples is located in data/observations.tsv, and the motion class classification done for individual predicates is stored in data/motion_classes.tsv. The materials can also be browsed in an accompanying website. In the future also a dataset that contains dialectal examples will be added. Authors of this dataset are Niko Partanen and Riku Erkkilä.

These materials are licensed under CC-BY license. The original data source is the Fenno-Ugrica collection of the National Library of Finland, and the majority of the texts are also available in Komi Online Library created and proofread by FU-Lab. We acknowledge that the materials used in the study are in Public Domain, and thereby can be used with no constraints. However, we appreciate citations to the dataset and publications where it has been used whenever that is reasonable. Please also acknowledge different authors and sources, and refer to them as appropriate.

Files

nikopartanen/komi-path-dataset-v1.0.zip

Files (1.6 MB)

Name Size Download all
md5:8b57a6d2bf5853bee7bbec1ec502c7b3
1.6 MB Preview Download

Additional details