Dataset Open Access

TUT Tietotalo Ambisonic Impulse Response

Sharath Adavanne; Joonas Nikunen; Archontis Politis; Tuomas Virtanen


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Computational auditory scene analysis</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Room impulse response</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Impulse response</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Ambisonic</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">DOA estimation</subfield>
  </datafield>
  <controlfield tag="005">20200124192517.0</controlfield>
  <controlfield tag="001">1443539</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Tampere University of Technology, Finland</subfield>
    <subfield code="a">Joonas Nikunen</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Aalto University, Finland</subfield>
    <subfield code="0">(orcid)0000-0002-0595-2356</subfield>
    <subfield code="a">Archontis Politis</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Tampere University of Technology, Finland</subfield>
    <subfield code="a">Tuomas Virtanen</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1691</subfield>
    <subfield code="z">md5:3d6a88819c2503f24108cb9fedc8369d</subfield>
    <subfield code="u">https://zenodo.org/record/1443539/files/LICENSE</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1065349040</subfield>
    <subfield code="z">md5:edb89dc54f5023f59cbf7bdca3ee0cf7</subfield>
    <subfield code="u">https://zenodo.org/record/1443539/files/Tietotalo_RIR.mat</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2018-10-03</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-tut-arg</subfield>
    <subfield code="o">oai:zenodo.org:1443539</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Tampere University of Technology, Finland</subfield>
    <subfield code="0">(orcid)0000-0002-5001-6911</subfield>
    <subfield code="a">Sharath Adavanne</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">TUT Tietotalo Ambisonic Impulse Response</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-tut-arg</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">637422</subfield>
    <subfield code="a">Computational Analysis of Everyday Soundscapes</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="a">Other (Non-Commercial)</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;&lt;strong&gt;Tampere University of Technology (TUT) Tietotalo Ambisonic Impulse Response&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This dataset consists of impulse responses (IR) from a real environment using the Eigenmike spherical microphone array. The recordings were done in a fairly large spaced corridor inside the university (Tietotalo building) with classrooms around it. The IR acquisition was done using a maximum length sequence (MLS). The measurement was done by slowly moving a Genelec G Two loudspeaker continuously playing the MLS around the Eigenmike in a circular trajectory. The playback volume was set to be 30 dB greater than the ambient sound level. The IRs were collected at elevations &amp;minus;40 to 40 with 10-degree increments at 1 m from the Eigenmike and at elevations &amp;minus;20 to 20 with 10-degree increments at 2 m.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;The moving-source IRs were obtained by a freely available tool from CHiME challenge which estimates the time-varying responses in STFT domain by forming a least-squares regression between the known measurement signal and the far-field recording independently at each frequency. The IR for any azimuth within one trajectory can be analyzed by assuming block-wise stationarity of acoustic channel. The CHiME IR estimation tool was applied independently on all 32 channels of the Eigenmike. For the dataset creation, we analyzed the DOA of each time frame using MUSIC and extracted IRs for azimuthal angles at 10&amp;deg; resolution (36 IRs for each elevation).&lt;/p&gt;

&lt;p&gt;The IR file is in .mat format and can be read both in Matlab and Python. The details of the IR file are as following,&lt;/p&gt;

&lt;p&gt;Size: (2, 9, 1025, 36, 4, 32) = (distance_wrt_mic, elevation_wrt_mic, FFT, &amp;nbsp;azimuth_wrt_mic, blocks, channels).&lt;/p&gt;

&lt;p&gt;where,&lt;/p&gt;

&lt;p&gt;distance_wrt_mic = two distances (1m and 2m)&lt;br&gt;
elevation_wrt_mic = 9 elevation angles (-40:10:40) at distance 1m, and 5 elevations angles (-20:10:20) at distance 2m.&lt;br&gt;
azimuth_wrt_mic = 36 azimuth angles (-180:10:180) for all distance-elevation combination&lt;br&gt;
The IRs were extracted assuming block-wise stationarity (four blocks) for each frequency bin (1025 bins).&lt;/p&gt;

&lt;p&gt;During synthesis, after convolving the IR with a&amp;nbsp;sound event, the 32 channel audio will have to be transformed to Ambisonic format using the transformation matrix of Eigenmike.&lt;/p&gt;

&lt;p&gt;This dataset was collected as part of the &amp;#39;&lt;a href="https://github.com/sharathadavanne/seld-net"&gt;Sound event localization and detection of overlapping sources using convolutional recurrent neural network&lt;/a&gt;&amp;#39; work, more details about this IR dataset can be found in this work.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Data collector (s):&lt;/strong&gt; Fagerlund, Eemi; Koskimies, Aino&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.1443538</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.1443539</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
759
198
views
downloads
All versions This version
Views 759759
Downloads 198198
Data volume 181.1 GB181.1 GB
Unique views 729729
Unique downloads 103103

Share

Cite as