AISHELL-Stammertalk 中文口吃数据库 A Mandarin stuttered speech dataset
Contributors
Research group:
Description
Dataset official website: https://aishelltech.com/aishell_6A
This Zenodo page contains dataset samples. To access and download the full dataset, please send an application here https://opendata.aishelltech.com/stammertalk
The AISHELL-Stammertalk datasets consists of recordings from 70 native mardarin AWS (Adults who stutter), including 46 males and 24 females. The total duration is 48.8 hours. Each participant engaged in a recording session lasting up to one hour, comprising two parts: conversation and voice command reading. Conversations were conducted through online interviews using platforms like Zoom or Tencent Meet, aiming to capture spontaneous speech on diverse topics. The interviewer, one of the two authors, posed questions based on a prepared list, with the flexibility to introduce impromptu questions as needed.
In the voice command reading part, participants were tasked with reading a set of 200 commands, categorized into car navigation and smart home device interaction. To ensure variety, a new set of 200 commands was introduced for every 25 participants, resulting in a dataset featuring a total of 600 unique commands. Participants were encouraged to employ the Voluntary Stuttering technique, deliberately introducing stuttering.
Five types of stuttering were specified by the annotation guidelines, including:
[]: Word/phrase repetition. Designated for marking entire repeated character or phrase.
/b: block. Gasps for air or stuttered pauses.
/p: prolongation. Elongated phoneme.
/r: sound repetition. Repeated phoneme that do not constitute an entire character.
/i: interjections. Filler characters due to stuttering e.g., ‘嗯’, ‘啊’, or ‘呃’. Notably, naturally occurring interjections that don't disrupt the speech flow are excluded.
Files
Samples_AS-70.zip
Files
(1.2 MB)
Name | Size | Download all |
---|---|---|
md5:418b24c4ecc81b021382a8d2cf476d46
|
1.2 MB | Preview Download |
Additional details
Related works
- Is published in
- Conference paper: arXiv:2406.07256 (arXiv)
Dates
- Available
-
2024-07