Modeling perception with hierarchical prediction: Auditory segmentation with deep predictive coding locates candidate evoked potentials in EEG

André Ofner; Sebastian Stober

doi:10.5281/zenodo.4245496

Published October 11, 2020 | Version v1

Conference paper Open

Modeling perception with hierarchical prediction: Auditory segmentation with deep predictive coding locates candidate evoked potentials in EEG

The human response to music combines low-level expectations that are driven by the perceptual characteristics of audio with high-level expectations from the context and the listener's expertise. This paper discusses surprisal based music representation learning with a hierarchical predictive neural network. In order to inspect the cognitive validity of the network's predictions along their time-scales, we use the network's prediction error to segment electroencephalograms (EEG) based on the audio signal. Using the NMED-T dataset on passive natural music listening we explore the automatic segmentation of audio and EEG into events using the suggested model. By averaging only the EEG signal at predicted locations, we were able to visualize auditory evoked potentials connected to local and global musical structures. This indicates the potential of unsupervised predictive learning with deep neural networks as means to retrieve musical structure from audio and as a basis to uncover the corresponding cognitive processes in the human brain.

Files

219.pdf

Files (2.1 MB)

Name	Size	Download all
219.pdf md5:ace3a07d8c3ee04833270c1129c51a54	2.1 MB	Preview Download

115

Views

Downloads

Show more details

	All versions	This version
Views	115	115
Downloads	91	91
Data volume	209.3 MB	209.3 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 21st International Society for Music Information Retrieval Conference, 566-573. Montreal, Canada.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2020) , Montreal, Canada, October 11-16, 2020

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 5, 2020
Modified: July 19, 2024

Modeling perception with hierarchical prediction: Auditory segmentation with deep predictive coding locates candidate evoked potentials in EEG

Creators

Description

Files

219.pdf

Files (2.1 MB)