Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published September 5, 2023 | Version v1
Dataset Open

Vocal data in the Dioula language related to the numbers 1, 2, 3, and 4

  • 1. UNB/ESI
  • 1. University of Bordeaux
  • 2. UNB/ESI

Description

The dataset we have compiled for our research on "Setting up a speech recognition model for under-resourced languages" consists of audio recordings of Dioula speakers pronouncing the numbers 1, 2, 3, and 4. These recordings were collected under various conditions, featuring variability in speakers, accents, and environmental contexts. The data has been categorized into four distinct classes, each corresponding to one of the numbers (1, 2, 3, or 4), enabling the training and evaluation of a machine learning-based speech recognition model.

Files

AudiosDioula.zip

Files (296.3 MB)

Name Size Download all
md5:c71906caefb8507dc03de87ab49ae77c
296.3 MB Preview Download