Journal article Open Access

Goal-driven, neurobiological-inspired convolutional neural network models of human spatial hearing

van der Heijden, Kiki; Mehrkanoon, Siamak

MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="">
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u"></subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2021-12-06</subfield>
  <controlfield tag="005">20211206134843.0</controlfield>
  <controlfield tag="001">5760870</controlfield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="o"></subfield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;The human brain effortlessly solves the complex computational task of sound localization using a mixture of spatial cues. How the brain performs this task in naturalistic listening environments (e.g. with reverberation) is not well understood. In the present paper, we build on the success of deep neural networks at solving complex and high-dimensional problems&amp;nbsp;&lt;a href=""&gt;[1]&lt;/a&gt;&amp;nbsp;to develop goal-driven, neurobiological-inspired convolutional neural network (CNN) models of human spatial hearing. After training, we visualize and quantify feature representations in intermediate layers to gain insights into the representational mechanisms underlying sound location encoding in CNNs. Our results show that neurobiological-inspired CNN models trained on real-life sounds spatialized with human binaural hearing characteristics can accurately predict sound location in the horizontal plane. CNN localization acuity across the azimuth resembles human sound localization acuity, but CNN models outperform human sound localization in the back. Training models with different objective functions - that is, minimizing either Euclidean or angular distance - modulates localization acuity in particular ways. Moreover, different implementations of binaural integration result in unique patterns of localization errors that resemble behavioral observations in humans. Finally, feature representations reveal a gradient of spatial selectivity across network layers, starting with broad spatial representations in early layers and progressing to sparse, highly selective spatial representations in deeper layers. In sum, our results show that neurobiological-inspired CNNs are a valid approach to modeling human spatial hearing. This work paves the way for future studies combining neural network models with empirical measurements of neural activity to unravel the complex computational mechanisms underlying neural sound location encoding in the human auditory pathway.&lt;/p&gt;</subfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Maastricht University</subfield>
    <subfield code="a">Mehrkanoon, Siamak</subfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2841361</subfield>
    <subfield code="z">md5:7301978befdf37cd89e435cfe467cbb8</subfield>
    <subfield code="u"></subfield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">article</subfield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Donders Institute</subfield>
    <subfield code="0">(orcid)0000-0003-4516-7907</subfield>
    <subfield code="a">van der Heijden, Kiki</subfield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.1016/j.neucom.2021.05.104</subfield>
    <subfield code="2">doi</subfield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Goal-driven, neurobiological-inspired convolutional neural network models of human spatial hearing</subfield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">898134</subfield>
    <subfield code="a">Representational Mechanisms of Neural Location Encoding of Real-life Sounds in Normal and Hearing Impaired Listeners.</subfield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2"></subfield>
Views 37
Downloads 22
Data volume 62.5 MB
Unique views 34
Unique downloads 21


Cite as