Published February 28, 2015 | Version 1
Dataset Open

Corpus of Spanish Word-in-Noise Confusions

  • 1. Ikerbasque and University of the Basque Country, Spain
  • 2. University of the Basque Country, Spain
  • 3. University of Illinois, USA

Description

The dataset represents a large-scale corpus of noise-induced robust misperceptions in Spanish. The corpus contains 3235 consistent misperceptions, selected for the corpus if at least 6 listeners reported the same response from a group of 15 listeners. The dataset consists of a metadata table, separate audio waveforms for the speech and noise signals that led to each confusion, and masker waveforms.

The corpus was described in the following journal article: http://dx.doi.org/10.1121/1.4905877  

Notes

Corpus collection was funded by the European Community 7th Framework Programme Marie Curie Initial Training Network INSPIRE (Grant Agreement No. FP7-PEOPLE-2011-290000). We thank Dr. Jon Barker for valuable discussions.

Files

SpanishConfusionsCorpus.zip

Files (286.1 MB)

Name Size Download all
md5:c9367123ebc2d6e290b7bc97f526a274
286.1 MB Preview Download