Making sense of periodicity glimpses in a prediction-update-loop - a computational model of attentive voice tracking

Joanna Luberadzka; Hendrik Kayser; Volker Hohmann

doi:10.5281/zenodo.6674400

Published February 3, 2022 | Version v4

Journal article Open

Making sense of periodicity glimpses in a prediction-update-loop - a computational model of attentive voice tracking

1. University of Oldenburg

Humans are able to follow a given speaker even in challenging acoustic conditions. The perceptual mechanisms underlying this ability remain unclear. In this study, we present a computational model of attentive voice tracking, consisting of four main computational blocks: A) sparse periodicity-based auditory feature extraction, B) foreground-background segregation, C) state estimation and D) top-down knowledge. Conceptually, the model brings together ideas related to auditory glimpses, foreground-background segregation and Bayesian inference. Algorithmically, it combines sparse periodicity features, sequential Monte Carlo sampling and probabilistic voice models. We evaluate the model by comparing it with the data obtained by listeners in the study of Woods and McDermott (2015), which measured the ability to track of one of two competing voices with time-varying parameters (fundamental frequency (F 0) and first two formants (F 1, F 2)). We simulate two experiments: Stream Segregation of Sources Varying in Just One Feature and Effect of Source Proximity. In both experiments, we test three model versions, which differ in the type of information used in the segregation stage: version 1 uses oracle F0, version 2 uses estimated F0 and version 3 uses spectral shape derived from estimated F0 and oracle F1 and F2. Version 1 simulates optimal human performance in the conditions with the largest separation between the voices, version 2 simulates conditions where the separation between the voices in not sufficient for humans to follow the voices, and version 3 is closest to human performance for moderate separation between the voices.

Files

Final.pdf

Files (8.7 MB)

Name	Size	Download all
Final.pdf md5:d8f2491168e1f69ae69345118a3791ca	8.7 MB	Preview Download

Additional details

National Institutes of Health
Open community platform for hearing aid research 1R01DC015429-01

	All versions	This version
Views	506	59
Downloads	1,078	264
Data volume	8.6 GB	2.3 GB

Making sense of periodicity glimpses in a prediction-update-loop - a computational model of attentive voice tracking

Authors/Creators

Description

Files

Final.pdf

Files (8.7 MB)

Additional details

Funding