Replicating Human Sound Localization with a Multi-Layer Perceptron

Sumner, Eric Michael; Unnthorsson, Runar; Riedel, Morris

doi:10.5281/zenodo.6797854

Published June 7, 2022 | Version v2

Conference paper Open

Replicating Human Sound Localization with a Multi-Layer Perceptron

1. University of Iceland
2. University of Iceland & Jülich Supercomputing Centre

One of the key capabilities of the human sense of hearing is to determine the direction from which a sound is emanating, a task known as localization. This paper describes the derivation of a machine learning model which performs the same localization task: Given an audio waveform which arrives at the listener’s eardrum, determine the direction of the audio source. Head-related transfer functions (HRTFs) from the ITA-HRTF database of 48 individuals are used to train and validate this model. A series of waveforms is generated from each HRTF, representing the sound pressure level at the listener’s eardrums for various source directions. A feature vector is calculated for each waveform from acoustical properties motivated by prior literature on sound localization; these feature vectors are used to train multi-layer perceptrons (MLPs), a form of artificial neural network, to replicate the behavior of single individuals. Data from three individuals are used to optimize hyperparameters of both the feature extraction and MLP stages for model accuracy. These hyperparameters are then validated by training and analyzing models for all 48 individuals in the database. The errors produced by each model fall in a log-normal distribution. The median model is capable of identifying, with 95% confidence, the sound source direction to within 20 degrees. This result is comparable to previously-reported human capabilities and thus shows that an MLP can successfully replicate the human sense of sound localization.

Files

35.pdf

Files (993.8 kB)

Name	Size	Download all
35.pdf md5:88bec812883221876a4c9385542e0e2b	993.8 kB	Preview Download

Citations

Oops! Something went wrong while fetching results.

	All versions	This version
Views	202	127
Downloads	225	114
Data volume	239.4 MB	127.2 MB

Replicating Human Sound Localization with a Multi-Layer Perceptron

Creators

Description

Files

35.pdf

Files (993.8 kB)