Published July 13, 2024 | Version 1.0.0
Dataset Open

AISHELL-Stammertalk 中文口吃数据库 A Mandarin stuttered speech dataset

Description

Dataset official website: https://aishelltech.com/aishell_6A

This Zenodo page contains dataset samples. To access and download the full dataset, please send an application here https://opendata.aishelltech.com/stammertalk

The AISHELL-Stammertalk datasets consists of recordings from 70 native mardarin AWS (Adults who stutter), including 46 males and 24 females. The total duration is 48.8 hours. Each participant engaged in a recording session lasting up to one hour, comprising two parts: conversation and voice command reading. Conversations were conducted through online interviews using platforms like Zoom or Tencent Meet, aiming to capture spontaneous speech on diverse topics. The interviewer, one of the two authors, posed questions based on a prepared list, with the flexibility to introduce impromptu questions as needed.

In the voice command reading part, participants were tasked with reading a set of 200 commands, categorized into car navigation and smart home device interaction. To ensure variety, a new set of 200 commands was introduced for every 25 participants, resulting in a dataset featuring a total of 600 unique commands. Participants were encouraged to employ the Voluntary Stuttering technique, deliberately introducing stuttering.

Five types of stuttering were specified by the annotation guidelines, including:
[]: Word/phrase repetition. Designated for marking entire repeated character or phrase.
/b: block. Gasps for air or stuttered pauses.
/p: prolongation. Elongated phoneme.
/r: sound repetition. Repeated phoneme that do not constitute an entire character.
/i: interjections. Filler characters due to stuttering e.g., ‘嗯’, ‘啊’, or ‘呃’. Notably, naturally occurring interjections that don't disrupt the speech flow are excluded.

Files

Samples_AS-70.zip

Files (1.2 MB)

Name Size Download all
md5:418b24c4ecc81b021382a8d2cf476d46
1.2 MB Preview Download

Additional details

Related works

Is published in
Conference paper: arXiv:2406.07256 (arXiv)

Dates

Available
2024-07