Published September 16, 2022 | Version 2.0.0
Dataset Open

EEG Dataset for 'Decoding of selective attention to continuous speech from the human auditory brainstem response' and 'Neural Speech Tracking in the Theta and in the Delta Frequency Band Differentially Encode Clarity and Comprehension of Speech in Noise'.

  • 1. Imperial College London
  • 2. FAU-Erlangen

Description

The repository contains the unprocessed EEG data recorded for the publications [1, 2]. For convenience, the onsets of the EEG data provided here are time-aligned with the onsets of the audio books in the 'audiobooks' folder, and the EEG data are provided in HDF5 format. Please refer to the original version of this dataset for more details.

More details, as well as the original data files, are available at the original repository here.

Examples of using these data (preprocessing, fitting linear models) can be found here.

The English conditions (clean, lb, mb, hb, fM, fW) comprised a single recording session. The Dutch conditions (cleanDutch, lbDutch, mbDutch, hbDutch) comprised a separate recording session. You see which participants took part in each session in session_info.json.

Please note some details about the stimulus presentation for the various listening conditions:

  • English speech-in-babble-noise (lb, mb, hb): babble noise was played by itself for one second before the audiobook track began. The babble noise was also played for one second after the audiobook track ended. Therefore, you should discard the first second and the last second from these trial during your analysis.
  • Dutch speech-in-babble-noise (lbDutch, mbDutch, hbDutch): the story (narrated in Dutch) was played by itself for one second before the babble noise track began. Then, the babble noise was increased linearly in amplitude for one second. Therefore, you should discard the first two seconds from these trials during your analysis.
  • Dutch in quiet, and Dutch-in-babble-noise (cleanDutch, lbDutch, mbDutch, hbDutch): some English sentences were embedded in the Dutch narratives in order to encourage attention. You should crop these from your analysis. The onsets and offsets of the English sentences (in samples, at 44100Hz) are provided in the audiobooks/*Dutch/english_onsets_info.json files.
  • Competing-speakers conditions (fM, fW): sometimes the attended track is longer than the unattended track, or vice-versa. The onsets of both tracks are aligned. You should crop the trial to the length of the shortest track for your analysis.

If you use this data, please cite the original publications, as well as this repository [1,2,3].

[1] Etard O, Kegler M, Braiman C, Forte A E and Reichenbach T. “Decoding of selective attention to continuous speech from the human auditory brainstem response” 2019. NeuroImage 200 1–11

[2] Etard O and Reichenbach T. “Neural speech tracking in the theta and in the delta frequency band differentially encode clarity and comprehension of speech in noise” 2019. J. Neurosci. 39 5750–9

[3] Etard O and Reichenbach T. "EEG Dataset for 'Decoding of selective attention to continuous speech from the human auditory brainstem response' and 'Neural Speech Tracking in the Theta and in the Delta Frequency Band Differentially Encode Clarity and Comprehension of Speech in Noise". Doi: 10.5281/zenodo.7086208

Files

audiobooks.zip

Files (21.1 GB)

Name Size Download all
md5:1d5d1d80fa0e72514633b2585c45d479
9.7 kB Download
md5:40131b6b54c49aa98f197653dbe7f543
1.9 GB Preview Download
md5:c47c1261c9ec82c7aa0a4bef3d7450c9
19.2 GB Preview Download
md5:e653e9a95c624086e64d5d9393eac92c
297 Bytes Preview Download