Clear Speech Data for Syllable-Rate-Adjusted-Modulation (SRAM)
- 1. University of California, Irvine
Description
This Dataset is associated with the paper "Syllable-Rate-Adjusted-Modulation (SRAM) Predicts Clear and Conversational Speech Intelligibility". It contains 144 sentences recorded from two talkers (one female, one male) in both clear and conversational styles (72 sentences in each style). The sample rate was 16000Hz. The silence periods before and after the speech were removed. The speech scripts for each speech style and the human performance are included in each sub-folder.
SSN.wav is the steady-state noise used to create the noisy speeches.
File Structure:
- Female
- Clear
- 1.wav
- ...
- 72.wav
- Convo
- 1.wav
- ...
- 72.wav
- human_results.csv
- key_words_clear.txt
- key_words_conv.txt
- Male
- SSN.wav
Files
Files
(14.2 MB)
Name | Size | Download all |
---|---|---|
md5:b45ffccc5dff1c98b00e04f430ee5ad8
|
14.2 MB | Download |
Additional details
References
- Liu, Sheng, Elsa Del Rio, Ann R. Bradlow, and Fan-Gang Zeng. 2004. "Clear Speech Perception in Acoustic and Electric Hearing." The Journal of the Acoustical Society of America 116 (4): 2374–83. https://doi.org/10.1121/1.1787528.