Dataset Open Access

TAU Spatial Sound Events 2019 - Ambisonic and Microphone Array, Development Datasets

Sharath Adavanne; Archontis Politis; Tuomas Virtanen


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Ambisonic</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Computational Auditory Scene Analysis</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Sound Event Detection</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">DOA Estimation</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Sound Event Localization and Detection</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Acoustic Event Detection</subfield>
  </datafield>
  <controlfield tag="005">20200124192550.0</controlfield>
  <controlfield tag="001">2599196</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Tampere University</subfield>
    <subfield code="0">(orcid)0000-0002-0595-2356</subfield>
    <subfield code="a">Archontis Politis</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Tampere University</subfield>
    <subfield code="a">Tuomas Virtanen</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2147483648</subfield>
    <subfield code="z">md5:bd5b18a47a3ed96e80069baa6b221a5a</subfield>
    <subfield code="u">https://zenodo.org/record/2599196/files/foa_dev.z01</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2147483648</subfield>
    <subfield code="z">md5:5194ebf43ae095190ed78691ec9889b1</subfield>
    <subfield code="u">https://zenodo.org/record/2599196/files/foa_dev.z02</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">136334885</subfield>
    <subfield code="z">md5:2154ad0d9e1e45bfc933b39591b49206</subfield>
    <subfield code="u">https://zenodo.org/record/2599196/files/foa_dev.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1722</subfield>
    <subfield code="z">md5:938608750cf730fd98a8646bfe75718e</subfield>
    <subfield code="u">https://zenodo.org/record/2599196/files/LICENSE</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">386914</subfield>
    <subfield code="z">md5:c2e5c8b0ab430dfd76c497325171245d</subfield>
    <subfield code="u">https://zenodo.org/record/2599196/files/metadata_dev.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2147483648</subfield>
    <subfield code="z">md5:3234cf0bfa7b71465ae1d67c833f7c12</subfield>
    <subfield code="u">https://zenodo.org/record/2599196/files/mic_dev.z01</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1532650663</subfield>
    <subfield code="z">md5:6426da74fecb351dd5add56716499e40</subfield>
    <subfield code="u">https://zenodo.org/record/2599196/files/mic_dev.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">17044</subfield>
    <subfield code="z">md5:41f8ab442fd2a6c0ae554c77e4a2062e</subfield>
    <subfield code="u">https://zenodo.org/record/2599196/files/README.html</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">13940</subfield>
    <subfield code="z">md5:4aa8f1ed840b0865ac61375ef9dd52de</subfield>
    <subfield code="u">https://zenodo.org/record/2599196/files/README.md</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2019-02-28</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-tut-arg</subfield>
    <subfield code="o">oai:zenodo.org:2599196</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Tampere University</subfield>
    <subfield code="0">(orcid)0000-0002-5001-6911</subfield>
    <subfield code="a">Sharath Adavanne</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">TAU Spatial Sound Events 2019 - Ambisonic and Microphone Array, Development Datasets</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-tut-arg</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">637422</subfield>
    <subfield code="a">Computational Analysis of Everyday Soundscapes</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="a">Other (Non-Commercial)</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;This package consists of two development datasets, &lt;strong&gt;TAU Spatial Sound Events 2019 - Ambisonic&lt;/strong&gt;&amp;nbsp;and &lt;strong&gt;TAU Spatial Sound Events 2019 - Microphone Array&lt;/strong&gt;. These datasets contain recordings from an identical scene, with &lt;strong&gt;TAU Spatial Sound Events 2019 - Ambisonic&lt;/strong&gt;&amp;nbsp;providing four-channel First-Order Ambisonic (FOA) recordings while &lt;strong&gt;TAU Spatial Sound Events 2019 - Microphone Array&lt;/strong&gt;&amp;nbsp;provides four-channel directional microphone recordings from a tetrahedral array configuration. Both formats are extracted from the same microphone array. The recordings in the two datasets consist of stationary point sources from multiple sound classes each associated with a temporal onset and offset time, and DOA coordinate represented using azimuth and elevation angle. These development datasets are part of the &lt;a href="https://github.com/sharathadavanne/seld-dcase2019"&gt;DCASE 2019 Sound Event Localization and Detection Task&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Both the development set consists of 400, one minute long recordings sampled at 48000 Hz, and divided into four cross-validation splits of 100 recordings each. These recordings were synthesized using spatial room impulse response (IRs) collected from five indoor locations, at 504 unique combinations of azimuth-elevation-distance. Furthermore, in order to synthesize the recordings, the collected IRs were convolved with &lt;a href="http://www.cs.tut.fi/sgn/arg/dcase2016/task-sound-event-detection-in-synthetic-audio#audio-dataset"&gt;isolated sound events dataset from DCASE 2016 task 2&lt;/a&gt;. Finally, to create a realistic sound scene recording, natural ambient noise collected in the IR recording locations was added to the synthesized recordings such that the average SNR of the sound events was 30 dB.&lt;/p&gt;

&lt;p&gt;The IRs were collected in Finland by Tampere University between 12/2017 - 06/2018. The data collection received funding from the European Research Council, grant agreement 637422 EVERYSOUND.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Download instructions&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The three files, &amp;nbsp;&lt;strong&gt;&lt;em&gt;foa_dev.z01&lt;/em&gt;&lt;/strong&gt;,&lt;strong&gt;&lt;em&gt; foa_dev.z02&lt;/em&gt;&lt;/strong&gt;&amp;nbsp;and &lt;strong&gt;&lt;em&gt;foa_dev.zip&lt;/em&gt;&lt;/strong&gt;, correspond to audio data of &lt;strong&gt;TAU Spatial Sound Events 2019 - Ambisonic&lt;/strong&gt;&amp;nbsp;development dataset.&lt;br&gt;
The two files, &lt;strong&gt;&lt;em&gt;mic_dev.z01&lt;/em&gt;&lt;/strong&gt;&amp;nbsp;and, &lt;strong&gt;&lt;em&gt;mic_dev.zip&lt;/em&gt;&lt;/strong&gt;, correspond to audio data of &lt;strong&gt;TAU Spatial Sound Events 2019 - Microphone Array&lt;/strong&gt;&amp;nbsp;development dataset.&lt;br&gt;
The &lt;strong&gt;&lt;em&gt;metadata_dev.zip&lt;/em&gt;&lt;/strong&gt;&amp;nbsp;is the common metadata for both &lt;strong&gt;TAU Spatial Sound Events 2019 - Ambisonic&lt;/strong&gt;&amp;nbsp;and &lt;strong&gt;TAU Spatial Sound Events 2019 - Microphone Array&lt;/strong&gt;&amp;nbsp;development datasets.&lt;/p&gt;

&lt;p&gt;Download the zip files corresponding to the dataset of interest and use your favorite compression tool to unzip these split zip files.&lt;br&gt;
&amp;nbsp;&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.2580090</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.2599196</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
3,461
18,971
views
downloads
All versions This version
Views 3,4612,419
Downloads 18,9719,235
Data volume 31.4 TB14.0 TB
Unique views 2,9002,030
Unique downloads 2,9282,402

Share

Cite as