Published January 3, 2020 | Version 1
Dataset Open

VOICe Dataset

  • 1. Audio Research Group, Tampere University

Description

VOICe: A novel dataset for the development and evaluation of generalizable sound event detection domain adaptation methods!

VOICe consists of 1449 different mixtures of three different sound events ("baby crying", "glass breaking", and "gunshot"):

  • 1242 mixtures with background noise of three different categories of acoustic scenes ("vehicle"," outdoors", and "indoors"), mixed under 2 SNR values (-3, -9 dB), that is 207 mixtures x 3 acoustic scenes x 2 SNRs = 1242
  •  207 mixtures without any background noise.

 VOICe is offered for sound event detection domain adaptation from one acoustic scene to another, or between sound events with background noise and without background noise.

You can also find more information about the dataset in our paper: https://arxiv.org/pdf/1911.07098.pdf

Files

Files (44.1 GB)

Name Size Download all
md5:511838a9a2036ebfdc430dba85700a88
3.5 GB Download
md5:5ba81dae0a0eaa81751a732c87a19e99
20.3 GB Download
md5:a20b2bad0e8bb3881c54964be4ac2f7f
20.3 GB Download

Additional details

Related works

Is documented by
Conference paper: https://arxiv.org/abs/1911.07098 (URL)

Funding

EVERYSOUND – Computational Analysis of Everyday Soundscapes 637422
European Commission