Dataset Open Access

Nganasan and Kamas Speech Recognition Models

Partanen, Niko; Hämäläinen, Mika; Klooster, Tiina


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "nio", 
    "@type": "Language", 
    "name": "Nganasan"
  }, 
  "description": "<p>These are the models trained in our paper</p>\n\n<p>Partanen, N., H&auml;m&auml;l&auml;inen, M. and Klooster, T. (2020)&nbsp;Speech Recognition for Endangered and Extinct Samoyedic languages. In&nbsp;<em>Proceedings of&nbsp;the 34th Pacific Asia Conference on Language, Information and Computation</em>.</p>\n\n<p>See the readme for more</p>\n\n<p><sub>Based on corpora from</sub></p>\n\n<p><sub>Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Be&aacute;ta. 2019. &quot;INEL Kamas Corpus.&quot; Version 1.0. Publication date 2019-12-15. http://hdl.handle.net/11022/0000-0007-DA6E-9. Archived in Hamburger Zentrum f&uuml;r Sprachkorpora. In: Wagner-Nagy, Be&aacute;ta; Arkhipov, Alexandre; Ferger, Anne; Jettka, Daniel; Lehmberg, Timm (eds.). The INEL corpora of indigenous Northern Eurasian languages.</sub></p>\n\n<p><sub>Maria Brykina, Valentin Gusev, Sandor Szever&eacute;nyi, and Be&aacute;ta Wagner-Nagy. 2018. Nganasan spoken language corpus (nslc). Archived in Hamburger Zentrumf&uuml;r Sprachkorpora. Version 0.2. Publication date, 12.</sub></p>", 
  "license": "https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "University of Helsinki", 
      "@id": "https://orcid.org/0000-0001-8584-3880", 
      "@type": "Person", 
      "name": "Partanen, Niko"
    }, 
    {
      "affiliation": "University of Helsinki", 
      "@id": "https://orcid.org/0000-0001-9315-1278", 
      "@type": "Person", 
      "name": "H\u00e4m\u00e4l\u00e4inen, Mika"
    }, 
    {
      "affiliation": "Luua Forestry School", 
      "@type": "Person", 
      "name": "Klooster, Tiina"
    }
  ], 
  "url": "https://zenodo.org/record/4029494", 
  "datePublished": "2020-09-14", 
  "version": "1.0", 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/emur_kamas_batches.R", 
      "encodingFormat": "r", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/emur_kamas_restructure.R", 
      "encodingFormat": "r", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/experiment_01_data.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/experiment_02_data.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/experiment_03_data.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/experiment_07_10_data.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/models_and_scripts.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/preprocess_kamas.py", 
      "encodingFormat": "py", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/preprocess_nganasan.py", 
      "encodingFormat": "py", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/README.md", 
      "encodingFormat": "md", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/sampa_inventory.txt", 
      "encodingFormat": "txt", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e87ce363-b7aa-4387-87fc-2042e4f4ad28/xas-sampa.txt", 
      "encodingFormat": "txt", 
      "@type": "DataDownload"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.4029494", 
  "@id": "https://doi.org/10.5281/zenodo.4029494", 
  "@type": "Dataset", 
  "name": "Nganasan and Kamas Speech Recognition Models"
}
33
19
views
downloads
All versions This version
Views 3333
Downloads 1919
Data volume 4.8 GB4.8 GB
Unique views 2727
Unique downloads 66

Share

Cite as