Dataset Open Access

TAU Spatial Sound Events 2019 - Ambisonic and Microphone Array, Development Datasets

Sharath Adavanne; Archontis Politis; Tuomas Virtanen

This package consists of two development datasets, TAU Spatial Sound Events 2019 - Ambisonic and TAU Spatial Sound Events 2019 - Microphone Array. These datasets contain recordings from an identical scene, with TAU Spatial Sound Events 2019 - Ambisonic providing four-channel First-Order Ambisonic (FOA) recordings while TAU Spatial Sound Events 2019 - Microphone Array provides four-channel directional microphone recordings from a tetrahedral array configuration. Both formats are extracted from the same microphone array. The recordings in the two datasets consist of stationary point sources from multiple sound classes each associated with a temporal onset and offset time, and DOA coordinate represented using azimuth and elevation angle. These development datasets are part of the DCASE 2019 Sound Event Localization and Detection Task.

Both the development set consists of 400, one minute long recordings sampled at 48000 Hz, and divided into four cross-validation splits of 100 recordings each. These recordings were synthesized using spatial room impulse response (IRs) collected from five indoor locations, at 504 unique combinations of azimuth-elevation-distance. Furthermore, in order to synthesize the recordings, the collected IRs were convolved with isolated sound events dataset from DCASE 2016 task 2. Finally, to create a realistic sound scene recording, natural ambient noise collected in the IR recording locations was added to the synthesized recordings such that the average SNR of the sound events was 30 dB.

The IRs were collected in Finland by Tampere University between 12/2017 - 06/2018. The data collection received funding from the European Research Council, grant agreement 637422 EVERYSOUND.

Download instructions

The three files,  foa_dev.z01, foa_dev.z02 and foa_dev.zip, correspond to audio data of TAU Spatial Sound Events 2019 - Ambisonic development dataset.
The two files, mic_dev.z01 and, mic_dev.zip, correspond to audio data of TAU Spatial Sound Events 2019 - Microphone Array development dataset.
The metadata_dev.zip is the common metadata for both TAU Spatial Sound Events 2019 - Ambisonic and TAU Spatial Sound Events 2019 - Microphone Array development datasets.

Download the zip files corresponding to the dataset of interest and use your favorite compression tool to unzip these split zip files.
 

Files (8.1 GB)
Name Size
foa_dev.z01
md5:bd5b18a47a3ed96e80069baa6b221a5a
2.1 GB Download
foa_dev.z02
md5:5194ebf43ae095190ed78691ec9889b1
2.1 GB Download
foa_dev.zip
md5:2154ad0d9e1e45bfc933b39591b49206
136.3 MB Download
LICENSE
md5:938608750cf730fd98a8646bfe75718e
1.7 kB Download
metadata_dev.zip
md5:c2e5c8b0ab430dfd76c497325171245d
386.9 kB Download
mic_dev.z01
md5:3234cf0bfa7b71465ae1d67c833f7c12
2.1 GB Download
mic_dev.zip
md5:6426da74fecb351dd5add56716499e40
1.5 GB Download
README.html
md5:41f8ab442fd2a6c0ae554c77e4a2062e
17.0 kB Download
README.md
md5:4aa8f1ed840b0865ac61375ef9dd52de
13.9 kB Download
1,964
11,845
views
downloads
All versions This version
Views 1,9641,330
Downloads 11,8454,532
Data volume 19.3 TB6.5 TB
Unique views 1,6831,137
Unique downloads 1,5551,167

Share

Cite as