Children speech recording (English, spontaneous speech + pre-defined sentences)

Published December 13, 2016 | Version v1

Dataset Open

The dataset contains audio recordings (lossless WAV) of 11 young children (age M=4.9 years old; 5 females, 6 males).

Recordings include:

The recordings are in English and the participants include both native and non-native speakers.

Each sample is recorded from 3 sources:

(note that, due to technical issues, a few (sample/microphone) combinations are missing).

For the free-speech recording, a manual segmentation of the utterances is provided as well.

Files

Name	Size	Download all
english_children.zip md5:1a4fd6116554593324a0a493e44a1eea	607.5 MB	Preview Download

DoRoThy – Donating Robots a Theory of Mind 657227: European Commission
DREAM – Development of Robot-Enhanced therapy for children with AutisM spectrum disorders 611391: European Commission
L2TOR – Second Language Tutoring using Social Robots 688014: European Commission

J. Kennedy, S. Lemaignan, C. Montassier, P. Lavalade, B. Irfan, F. Papadopoulos, E. Senft, T. Belpaeme (2017) Child Speech Recognition in Human-Robot Interaction: Evaluations and Recommendations