Published July 2, 2024 | Version 1
Preprint Open

Robust decoding of the speech envelope from EEG recordings through deep neural networks

  • 1. Imperial College London, UK
  • 2. Friedrich-Alexander-University Erlangen-Nürnberg, Germany
  • 1. Lyon Neuroscience Research Center, CNRS UMR5292, Inserm U1028, Université Claude Bernard Lyon 1, Université Jean Monnet Saint-Étienne, Lyon, France
  • 2. ENTPE, Laboratoire Génie Civil et Bâtiment, Vaulx-en-Velin, France
  • 3. Starkey France, Créteil, France

Description

During speech perception, a listener's brain activity tracks amplitude modulations in the speech signal, which are encoded in the speech envelope. The neural tracking of the speech envelope is modulated by cognitive factors such as attention to one of several competing speakers. Decoding the speech envelope from noninvasive electroencephalography (EEG) may be useful in future auditory prosthesis that could restore speech comprehension in noisy environments. Such applications require, however, a robust decoding of the speech envelope that functions in different acoustic conditions and that generalizes between different participants. Here we show that deep neural networks (DNNs) can lead to an enhanced decoding that has around 38% higher performance than the standard method, linear regression. The advantage of the DNNs persists across different acoustic scenarios and also when listener-independent decoders are used. We also show how an improved envelope decoding performance translates into a higher auditory attention decoding accuracy for the DNNs, in comparison to the method of linear modelling. Our work therefore demonstrates that DNNs show promise for data efficient, real-world auditory attention decoding.

Files

ISH2022_Thornton_etal.pdf

Files (386.3 kB)

Name Size Download all
md5:966151e34921fefa39a4f6e86eb220a7
386.3 kB Preview Download

Additional details

Funding

UKRI Centre for Doctoral Training in Artificial Intelligence for Healthcare EP/S023283/1
UK Research and Innovation