EEG Dataset for 'Decoding of selective attention to continuous speech from the human auditory brainstem response' and 'Neural Speech Tracking in the Theta and in the Delta Frequency Band Differentially Encode Clarity and Comprehension of Speech in Noise'.
Description
The repository contains the unprocessed EEG data recorded for the publications [1, 2]. For convenience, the onsets of the EEG data provided here are time-aligned with the onsets of the audio books in the 'audiobooks' folder, and the EEG data are provided in HDF5 format. Please refer to the original version of this dataset for more details.
More details, as well as the original data files, are available at the original repository here.
Examples of using these data (preprocessing, fitting linear models) can be found here.
The English conditions (clean, lb, mb, hb, fM, fW) comprised a single recording session. The Dutch conditions (cleanDutch, lbDutch, mbDutch, hbDutch) comprised a separate recording session. You see which participants took part in each session in session_info.json.
Please note some details about the stimulus presentation for the various listening conditions:
- English speech-in-babble-noise (lb, mb, hb): babble noise was played by itself for one second before the audiobook track began. The babble noise was also played for one second after the audiobook track ended. Therefore, you should discard the first second and the last second from these trial during your analysis.
- Dutch speech-in-babble-noise (lbDutch, mbDutch, hbDutch): the story (narrated in Dutch) was played by itself for one second before the babble noise track began. Then, the babble noise was increased linearly in amplitude for one second. Therefore, you should discard the first two seconds from these trials during your analysis.
- Dutch in quiet, and Dutch-in-babble-noise (cleanDutch, lbDutch, mbDutch, hbDutch): some English sentences were embedded in the Dutch narratives in order to encourage attention. You should crop these from your analysis. The onsets and offsets of the English sentences (in samples, at 44100Hz) are provided in the audiobooks/*Dutch/english_onsets_info.json files.
- Competing-speakers conditions (fM, fW): sometimes the attended track is longer than the unattended track, or vice-versa. The onsets of both tracks are aligned. You should crop the trial to the length of the shortest track for your analysis.
If you use this data, please cite the original publications, as well as this repository [1,2,3].
[1] Etard O, Kegler M, Braiman C, Forte A E and Reichenbach T. “Decoding of selective attention to continuous speech from the human auditory brainstem response” 2019. NeuroImage 200 1–11
[2] Etard O and Reichenbach T. “Neural speech tracking in the theta and in the delta frequency band differentially encode clarity and comprehension of speech in noise” 2019. J. Neurosci. 39 5750–9
[3] Etard O and Reichenbach T. "EEG Dataset for 'Decoding of selective attention to continuous speech from the human auditory brainstem response' and 'Neural Speech Tracking in the Theta and in the Delta Frequency Band Differentially Encode Clarity and Comprehension of Speech in Noise". Doi: 10.5281/zenodo.7086208