Published September 14, 2020 | Version 1.0
Dataset Open

Nganasan and Kamas Speech Recognition Models

  • 1. University of Helsinki
  • 2. Luua Forestry School

Description

These are the models trained in our paper

Partanen, N., Hämäläinen, M. and Klooster, T. (2020) Speech Recognition for Endangered and Extinct Samoyedic languages. In Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation.

See the readme for more

Based on corpora from

Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Beáta. 2019. "INEL Kamas Corpus." Version 1.0. Publication date 2019-12-15. http://hdl.handle.net/11022/0000-0007-DA6E-9. Archived in Hamburger Zentrum für Sprachkorpora. In: Wagner-Nagy, Beáta; Arkhipov, Alexandre; Ferger, Anne; Jettka, Daniel; Lehmberg, Timm (eds.). The INEL corpora of indigenous Northern Eurasian languages.

Maria Brykina, Valentin Gusev, Sandor Szeverényi, and Beáta Wagner-Nagy. 2018. Nganasan spoken language corpus (nslc). Archived in Hamburger Zentrumfür Sprachkorpora. Version 0.2. Publication date, 12.

Files

experiment_01_data.zip

Files (4.8 GB)

Name Size Download all
md5:1098c8cd6ea53dbc59e75b8c10091ab4
12.8 kB Download
md5:5998b3e977d68227666a910a5e453c81
5.4 kB Download
md5:c9d9c48d44513461c629eee85516aa1a
741.3 MB Preview Download
md5:fd2edafd69f70bec128e6aeca1602ed4
335.1 MB Preview Download
md5:3b23815a42f18f97a3282032ce8a58c2
391.1 MB Preview Download
md5:096dd12ab7cfc32605462a859929440f
2.6 GB Preview Download
md5:1cb081c4b5866523f9b10e68e3bbf6b4
707.6 MB Preview Download
md5:161f36b4fd60edc96cc284499d60d541
4.4 kB Download
md5:f7e495f0939bbc59666e72beea98fa9c
6.4 kB Download
md5:ad377a95cd3ba5979713c4d8d1b7145e
3.5 kB Preview Download
md5:2c7b8131d54e89ad2d4f12c56c6577be
50.9 kB Preview Download
md5:acd31c7719220e51118bab3d359d8db3
318 Bytes Preview Download