Dataset Open Access

Baule speech dataset

Dougban Monsia

Citation Style Language JSON Export

  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.6705861", 
  "title": "Baule  speech dataset", 
  "issued": {
    "date-parts": [
  "abstract": "<p>The dataset was created to enable research on automatic speech recognition in Boul&eacute; (Baule) language. The dataset was intentionally created with this task in mind, in order to participate in the Google NLP Hack Series: Intro to ASR Africa Challenge hosted on the Zindi Africa platform. It contains about 565 recordings of participants reading a transcription in Baule as spoken in C&ocirc;te d&rsquo;Ivoire, one sentence at a time. Each example contains the audio files and the associated text. The audio is recorded in a less noisy environment by the speakers using their android phone. The<br>\ndataset is multi-speaker, containing recordings from 4 volunteers (2 males and 2 females), where each volunteer contributed up to 141 recordings. The recordings took place in Abidjan, C&ocirc;te d&rsquo;Ivoire in April 2022.</p>", 
  "author": [
      "family": "Dougban Monsia"
  "version": "1.0", 
  "type": "dataset", 
  "id": "6705861"
All versions This version
Views 5050
Downloads 33
Data volume 138.6 MB138.6 MB
Unique views 4545
Unique downloads 33


Cite as