DESED

Turpault, Nicolas; Serizel, Romain; Shah, Ankit; Salamon, Justin

doi:10.5281/zenodo.3550599

Published November 19, 2019 | Version v1

Video/Audio Open

DESED

1. Université de Lorraine, CNRS, Inria, Loria, F-54000 Nancy, France
2. Language Technologies Institute, Carnegie Mellon University, Pittsburgh PA, United States
3. Adobe Research, San Francisco CA, United States

Link to the associated github repository: [https://github.com/turpaultn/Desed_synthetic](https://github.com/turpaultn/Desed_synthetic)

## Description
This repository gives the information and the code to download the data, reproduce the synthetic dataset used in DCASE 2019 task 4 and examples of how you can create your own data (using [Scaper](https://github.com/justinsalamon/scaper) [[1]](#1)).

You can find information about this dataset in this paper: [link](https://hal.inria.fr/hal-02160855).
The evaluation part was submitted to ICASSP and will be updated later.

* Background files are extracted from SINS [[2]](#2), MUSAN [[3]](#3) or Youtube and have been selected because they contain a very low amount of our sound event classes.
* Foreground files are extracted from Freesound [[4]](#4)[[5]](#5) and manually verified to check the quality and segmented to remove silences.

## References
<a id="1">[1]</a> J. Salamon, D. MacConnell, M. Cartwright, P. Li, and J. P. Bello. Scaper: A library for soundscape synthesis and augmentation
In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, Oct. 2017.

<a id="2">[2]</a> Gert Dekkers, Steven Lauwereins, Bart Thoen, Mulu Weldegebreal Adhana, Henk Brouckxon, Toon van Waterschoot, Bart Vanrumste, Marian Verhelst, and Peter Karsmakers.
The SINS database for detection of daily activities in a home environment using an acoustic sensor network.
In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2017 Workshop (DCASE2017), 32–36. November 2017.

<a id="3">[3]</a> David Snyder and Guoguo Chen and Daniel Povey.
MUSAN: A Music, Speech, and Noise Corpus.
arXiv, 1510.08484, 2015.

<a id="4">[4]</a> F. Font, G. Roma & X. Serra. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia. ACM, 2013.
 <a id="5">[5]</a> E. Fonseca, J. Pons, X. Favory, F. Font, D. Bogdanov, A. Ferraro, S. Oramas, A. Porter & X. Serra. Freesound Datasets: A Platform for the Creation of Open Audio Datasets.
In Proceedings of the 18th International Society for Music Information Retrieval Conference, Suzhou, China, 2017.

Files

Files (5.6 GB)

Name	Size	Download all
eval.tar.gz md5:cfbbfc3da8bf785613cc2e18e1d6a6e9	5.0 GB	Download
training.tar.gz md5:692e70392f57c8f3521087267e6d84ef	512.0 MB	Download

Additional details

Is supplement to: Conference paper: https://hal.inria.fr/hal-02160855v2 (URL)

	All versions	This version
Views	10,016	362
Downloads	15,209	62
Data volume	222.9 TB	237.6 GB

DESED

Creators

Description

Files

Files (5.6 GB)

Additional details

Related works