Dataset Open Access

Nganasan and Kamas Speech Recognition Models

Partanen, Niko; Hämäläinen, Mika; Klooster, Tiina


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">nio</subfield>
  </datafield>
  <controlfield tag="005">20200926122653.0</controlfield>
  <controlfield tag="001">4029494</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Helsinki</subfield>
    <subfield code="0">(orcid)0000-0001-9315-1278</subfield>
    <subfield code="a">Hämäläinen, Mika</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Luua Forestry School</subfield>
    <subfield code="a">Klooster, Tiina</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">12753</subfield>
    <subfield code="z">md5:1098c8cd6ea53dbc59e75b8c10091ab4</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/emur_kamas_batches.R</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">5384</subfield>
    <subfield code="z">md5:5998b3e977d68227666a910a5e453c81</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/emur_kamas_restructure.R</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">741298485</subfield>
    <subfield code="z">md5:c9d9c48d44513461c629eee85516aa1a</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/experiment_01_data.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">335134339</subfield>
    <subfield code="z">md5:fd2edafd69f70bec128e6aeca1602ed4</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/experiment_02_data.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">391067762</subfield>
    <subfield code="z">md5:3b23815a42f18f97a3282032ce8a58c2</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/experiment_03_data.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2582530402</subfield>
    <subfield code="z">md5:096dd12ab7cfc32605462a859929440f</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/experiment_07_10_data.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">707589843</subfield>
    <subfield code="z">md5:1cb081c4b5866523f9b10e68e3bbf6b4</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/models_and_scripts.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">4404</subfield>
    <subfield code="z">md5:161f36b4fd60edc96cc284499d60d541</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/preprocess_kamas.py</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">6412</subfield>
    <subfield code="z">md5:f7e495f0939bbc59666e72beea98fa9c</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/preprocess_nganasan.py</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">3510</subfield>
    <subfield code="z">md5:ad377a95cd3ba5979713c4d8d1b7145e</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/README.md</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">50924</subfield>
    <subfield code="z">md5:2c7b8131d54e89ad2d4f12c56c6577be</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/sampa_inventory.txt</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">318</subfield>
    <subfield code="z">md5:acd31c7719220e51118bab3d359d8db3</subfield>
    <subfield code="u">https://zenodo.org/record/4029494/files/xas-sampa.txt</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2020-09-14</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o">oai:zenodo.org:4029494</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">University of Helsinki</subfield>
    <subfield code="0">(orcid)0000-0001-8584-3880</subfield>
    <subfield code="a">Partanen, Niko</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Nganasan and Kamas Speech Recognition Models</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution Non Commercial Share Alike 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;These are the models trained in our paper&lt;/p&gt;

&lt;p&gt;Partanen, N., H&amp;auml;m&amp;auml;l&amp;auml;inen, M. and Klooster, T. (2020)&amp;nbsp;Speech Recognition for Endangered and Extinct Samoyedic languages. In&amp;nbsp;&lt;em&gt;Proceedings of&amp;nbsp;the 34th Pacific Asia Conference on Language, Information and Computation&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;See the readme for more&lt;/p&gt;

&lt;p&gt;&lt;sub&gt;Based on corpora from&lt;/sub&gt;&lt;/p&gt;

&lt;p&gt;&lt;sub&gt;Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Be&amp;aacute;ta. 2019. &amp;quot;INEL Kamas Corpus.&amp;quot; Version 1.0. Publication date 2019-12-15. http://hdl.handle.net/11022/0000-0007-DA6E-9. Archived in Hamburger Zentrum f&amp;uuml;r Sprachkorpora. In: Wagner-Nagy, Be&amp;aacute;ta; Arkhipov, Alexandre; Ferger, Anne; Jettka, Daniel; Lehmberg, Timm (eds.). The INEL corpora of indigenous Northern Eurasian languages.&lt;/sub&gt;&lt;/p&gt;

&lt;p&gt;&lt;sub&gt;Maria Brykina, Valentin Gusev, Sandor Szever&amp;eacute;nyi, and Be&amp;aacute;ta Wagner-Nagy. 2018. Nganasan spoken language corpus (nslc). Archived in Hamburger Zentrumf&amp;uuml;r Sprachkorpora. Version 0.2. Publication date, 12.&lt;/sub&gt;&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.4029493</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.4029494</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
46
58
views
downloads
All versions This version
Views 4646
Downloads 5858
Data volume 24.5 GB24.5 GB
Unique views 3838
Unique downloads 2222

Share

Cite as