Published October 11, 2023 | Version v1
Video/Audio Open

Clear Speech Data for Syllable-Rate-Adjusted-Modulation (SRAM)

Creators

  • 1. University of California, Irvine

Contributors

Researcher:

Supervisor:

  • 1. University of California, Irvine

Description

This Dataset is associated with the paper "Syllable-Rate-Adjusted-Modulation (SRAM) Predicts Clear and Conversational Speech Intelligibility". It contains 144 sentences recorded from two talkers (one female, one male) in both clear and conversational styles (72 sentences in each style). The sample rate was 16000Hz. The silence periods before and after the speech were removed. The speech scripts for each speech style and the human performance are included in each sub-folder.
SSN.wav is the steady-state noise used to create the noisy speeches.

File Structure:
- Female
    - Clear
        - 1.wav
        - ...
        - 72.wav
    - Convo
        - 1.wav
        - ...
        - 72.wav
    - human_results.csv
    - key_words_clear.txt
    - key_words_conv.txt
- Male
- SSN.wav

 

Files

Files (14.2 MB)

Name Size Download all
md5:b45ffccc5dff1c98b00e04f430ee5ad8
14.2 MB Download

Additional details

References

  • Liu, Sheng, Elsa Del Rio, Ann R. Bradlow, and Fan-Gang Zeng. 2004. "Clear Speech Perception in Acoustic and Electric Hearing." The Journal of the Acoustical Society of America 116 (4): 2374–83. https://doi.org/10.1121/1.1787528.