Published September 5, 2023
| Version v1
Dataset
Open
Vocal data in the Dioula language related to the numbers 1, 2, 3, and 4
Contributors
Data collector:
Project managers:
- 1. University of Bordeaux
- 2. UNB/ESI
Description
The dataset we have compiled for our research on "Setting up a speech recognition model for under-resourced languages" consists of audio recordings of Dioula speakers pronouncing the numbers 1, 2, 3, and 4. These recordings were collected under various conditions, featuring variability in speakers, accents, and environmental contexts. The data has been categorized into four distinct classes, each corresponding to one of the numbers (1, 2, 3, or 4), enabling the training and evaluation of a machine learning-based speech recognition model.
Files
AudiosDioula.zip
Files
(296.3 MB)
Name | Size | Download all |
---|---|---|
md5:c71906caefb8507dc03de87ab49ae77c
|
296.3 MB | Preview Download |