Published November 19, 2019
| Version v1
Video/Audio
Open
DESED
- 1. Université de Lorraine, CNRS, Inria, Loria, F-54000 Nancy, France
- 2. Language Technologies Institute, Carnegie Mellon University, Pittsburgh PA, United States
- 3. Adobe Research, San Francisco CA, United States
Description
Link to the associated github repository: [https://github.com/turpaultn/Desed_synthetic](https://github.com/turpaultn/Desed_synthetic)
## Description
This repository gives the information and the code to download the data, reproduce the synthetic dataset used in DCASE 2019 task 4 and examples of how you can create your own data (using [Scaper](https://github.com/justinsalamon/scaper) [[1]](#1)).
You can find information about this dataset in this paper: [link](https://hal.inria.fr/hal-02160855).
The evaluation part was submitted to ICASSP and will be updated later.
* Background files are extracted from SINS [[2]](#2), MUSAN [[3]](#3) or Youtube and have been selected because they contain a very low amount of our sound event classes.
* Foreground files are extracted from Freesound [[4]](#4)[[5]](#5) and manually verified to check the quality and segmented to remove silences.
## References
<a id="1">[1]</a> J. Salamon, D. MacConnell, M. Cartwright, P. Li, and J. P. Bello. Scaper: A library for soundscape synthesis and augmentation
In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, Oct. 2017.
<a id="2">[2]</a> Gert Dekkers, Steven Lauwereins, Bart Thoen, Mulu Weldegebreal Adhana, Henk Brouckxon, Toon van Waterschoot, Bart Vanrumste, Marian Verhelst, and Peter Karsmakers.
The SINS database for detection of daily activities in a home environment using an acoustic sensor network.
In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2017 Workshop (DCASE2017), 32–36. November 2017.
<a id="3">[3]</a> David Snyder and Guoguo Chen and Daniel Povey.
MUSAN: A Music, Speech, and Noise Corpus.
arXiv, 1510.08484, 2015.
<a id="4">[4]</a> F. Font, G. Roma & X. Serra. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia. ACM, 2013.
<a id="5">[5]</a> E. Fonseca, J. Pons, X. Favory, F. Font, D. Bogdanov, A. Ferraro, S. Oramas, A. Porter & X. Serra. Freesound Datasets: A Platform for the Creation of Open Audio Datasets.
In Proceedings of the 18th International Society for Music Information Retrieval Conference, Suzhou, China, 2017.
Files
Files
(5.6 GB)
Name | Size | Download all |
---|---|---|
md5:cfbbfc3da8bf785613cc2e18e1d6a6e9
|
5.0 GB | Download |
md5:692e70392f57c8f3521087267e6d84ef
|
512.0 MB | Download |
Additional details
Related works
- Is supplement to
- Conference paper: https://hal.inria.fr/hal-02160855v2 (URL)