Clear Speech Data for Syllable-Rate-Adjusted-Modulation (SRAM)

doi:10.5281/zenodo.8432843

Published October 11, 2023 | Version v1

Video/Audio Open

Clear Speech Data for Syllable-Rate-Adjusted-Modulation (SRAM)

Yang, Ye¹

1. University of California, Irvine

Researcher:

Liu, Sheng¹

Supervisor:

Zeng, Fan-Gang¹

1. University of California, Irvine

This Dataset is associated with the paper "Syllable-Rate-Adjusted-Modulation (SRAM) Predicts Clear and Conversational Speech Intelligibility". It contains 144 sentences recorded from two talkers (one female, one male) in both clear and conversational styles (72 sentences in each style). The sample rate was 16000Hz. The silence periods before and after the speech were removed. The speech scripts for each speech style and the human performance are included in each sub-folder.
SSN.wav is the steady-state noise used to create the noisy speeches.

File Structure:
- Female
    - Clear
        - 1.wav
        - ...
        - 72.wav
    - Convo
        - 1.wav
        - ...
        - 72.wav
    - human_results.csv
    - key_words_clear.txt
    - key_words_conv.txt
- Male
- SSN.wav

Files

Files (14.2 MB)

Name	Size	Download all
ClearSpeechToShare.tar.gz md5:b45ffccc5dff1c98b00e04f430ee5ad8	14.2 MB	Download

Additional details

Liu, Sheng, Elsa Del Rio, Ann R. Bradlow, and Fan-Gang Zeng. 2004. "Clear Speech Perception in Acoustic and Electric Hearing." The Journal of the Acoustical Society of America 116 (4): 2374–83. https://doi.org/10.1121/1.1787528.

	All versions	This version
Views	147	145
Downloads	8	8
Data volume	113.8 MB	113.8 MB

Clear Speech Data for Syllable-Rate-Adjusted-Modulation (SRAM)

Creators

Contributors

Researcher:

Supervisor:

Description

Files

Files (14.2 MB)

Additional details

References