Dataset Open Access

Baule speech dataset

Dougban Monsia


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/a0d9715c-4c09-4008-a865-270b52812cfb/bci-datasets.zip"
      }, 
      "checksum": "md5:0af939f1f4672ec887792ba1715ae3c1", 
      "bucket": "a0d9715c-4c09-4008-a865-270b52812cfb", 
      "key": "bci-datasets.zip", 
      "type": "zip", 
      "size": 46191343
    }
  ], 
  "owners": [
    354044
  ], 
  "doi": "10.5281/zenodo.6705861", 
  "stats": {
    "version_unique_downloads": 3.0, 
    "unique_views": 45.0, 
    "views": 50.0, 
    "version_views": 50.0, 
    "unique_downloads": 3.0, 
    "version_unique_views": 45.0, 
    "volume": 138574029.0, 
    "version_downloads": 3.0, 
    "downloads": 3.0, 
    "version_volume": 138574029.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.6705861", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.6705860", 
    "bucket": "https://zenodo.org/api/files/a0d9715c-4c09-4008-a865-270b52812cfb", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.6705860.svg", 
    "html": "https://zenodo.org/record/6705861", 
    "latest_html": "https://zenodo.org/record/6705861", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.6705861.svg", 
    "latest": "https://zenodo.org/api/records/6705861"
  }, 
  "conceptdoi": "10.5281/zenodo.6705860", 
  "created": "2022-06-23T17:09:55.347150+00:00", 
  "updated": "2022-06-24T01:51:39.077705+00:00", 
  "conceptrecid": "6705860", 
  "revision": 2, 
  "id": 6705861, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.6705861", 
    "description": "<p>The dataset was created to enable research on automatic speech recognition in Boul&eacute; (Baule) language. The dataset was intentionally created with this task in mind, in order to participate in the Google NLP Hack Series: Intro to ASR Africa Challenge hosted on the Zindi Africa platform. It contains about 565 recordings of participants reading a transcription in Baule as spoken in C&ocirc;te d&rsquo;Ivoire, one sentence at a time. Each example contains the audio files and the associated text. The audio is recorded in a less noisy environment by the speakers using their android phone. The<br>\ndataset is multi-speaker, containing recordings from 4 volunteers (2 males and 2 females), where each volunteer contributed up to 141 recordings. The recordings took place in Abidjan, C&ocirc;te d&rsquo;Ivoire in April 2022.</p>", 
    "contributors": [], 
    "title": "Baule  speech dataset", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "6705860"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "6705861"
          }
        }
      ]
    }, 
    "version": "1.0", 
    "keywords": [
      "bci", 
      "speech", 
      "audio", 
      "baule", 
      "baoul\u00e9", 
      "asr"
    ], 
    "publication_date": "2022-06-23", 
    "creators": [
      {
        "affiliation": "data354", 
        "name": "Dougban Monsia"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "dataset", 
      "title": "Dataset"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.6705860", 
        "relation": "isVersionOf"
      }
    ]
  }
}
50
3
views
downloads
All versions This version
Views 5050
Downloads 33
Data volume 138.6 MB138.6 MB
Unique views 4545
Unique downloads 33

Share

Cite as