There is a newer version of this record available.

Video/Audio Open Access

DESED_synthetic

Turpault, Nicolas; Serizel, Romain


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24/dcase21_synth.tar.gz"
      }, 
      "checksum": "md5:99cbb7b21299cd473e4acedfd5ad614f", 
      "bucket": "f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24", 
      "key": "dcase21_synth.tar.gz", 
      "type": "gz", 
      "size": 18874495973
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24/DESED_synth_dcase2019jams.tar.gz"
      }, 
      "checksum": "md5:e5d6348d9b9ca19d08b7afba0e987de3", 
      "bucket": "f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24", 
      "key": "DESED_synth_dcase2019jams.tar.gz", 
      "type": "gz", 
      "size": 3096604
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24/DESED_synth_dcase20_eval_jams.tar.gz"
      }, 
      "checksum": "md5:105774e4528b266c829f3a6fdad4397d", 
      "bucket": "f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24", 
      "key": "DESED_synth_dcase20_eval_jams.tar.gz", 
      "type": "gz", 
      "size": 325956
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24/DESED_synth_dcase20_train_val_jams.tar.gz"
      }, 
      "checksum": "md5:01f2ba4e33c82006d8e407b75f103fe7", 
      "bucket": "f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24", 
      "key": "DESED_synth_dcase20_train_val_jams.tar.gz", 
      "type": "gz", 
      "size": 1154751
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24/DESED_synth_eval_dcase2019.tar.gz"
      }, 
      "checksum": "md5:e1aad0a714bb98d2b58f3d62122077b8", 
      "bucket": "f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24", 
      "key": "DESED_synth_eval_dcase2019.tar.gz", 
      "type": "gz", 
      "size": 7710291574
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24/DESED_synth_soundbank.tar.gz"
      }, 
      "checksum": "md5:03b51e3506ae28157a26101748045e90", 
      "bucket": "f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24", 
      "key": "DESED_synth_soundbank.tar.gz", 
      "type": "gz", 
      "size": 2422047310
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24/meta_infos_2021.tar.gz"
      }, 
      "checksum": "md5:d8f3f212c28c60425696c7ab0420cd23", 
      "bucket": "f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24", 
      "key": "meta_infos_2021.tar.gz", 
      "type": "gz", 
      "size": 27454
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24/soundbank_validation.tsv"
      }, 
      "checksum": "md5:2eba5a6fe230baecc1803dab526a77a5", 
      "bucket": "f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24", 
      "key": "soundbank_validation.tsv", 
      "type": "tsv", 
      "size": 25859
    }
  ], 
  "owners": [
    62138
  ], 
  "doi": "10.5281/zenodo.4568779", 
  "stats": {
    "version_unique_downloads": 4385.0, 
    "unique_views": 17.0, 
    "views": 20.0, 
    "version_views": 2600.0, 
    "unique_downloads": 0.0, 
    "version_unique_views": 1904.0, 
    "volume": 0.0, 
    "version_downloads": 9586.0, 
    "downloads": 0.0, 
    "version_volume": 32592778791510.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.4568779", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.3550598", 
    "bucket": "https://zenodo.org/api/files/f1cc2e2c-e37d-4faf-8b33-029a9e5e5e24", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.3550598.svg", 
    "html": "https://zenodo.org/record/4568779", 
    "latest_html": "https://zenodo.org/record/4569096", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.4568779.svg", 
    "latest": "https://zenodo.org/api/records/4569096"
  }, 
  "conceptdoi": "10.5281/zenodo.3550598", 
  "created": "2021-02-28T14:20:35.433071+00:00", 
  "updated": "2021-02-28T21:37:36.617497+00:00", 
  "conceptrecid": "3550598", 
  "revision": 4, 
  "id": 4568779, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.4568779", 
    "description": "<p>Link to the associated github repository: <a href=\"https://github.com/turpaultn/Desed\">https://github.com/turpaultn/Desed</a></p>\n\n<p>Link to the papers: <a href=\"https://hal.inria.fr/hal-02160855\"><em>https://hal.inria.fr/hal-02160855</em></a>,&nbsp; <a href=\"https://hal.inria.fr/hal-02355573v1\">https://hal.inria.fr/hal-02355573v1</a></p>\n\n<p>Domestic Environment Sound Event Detection (DESED).</p>\n\n<p><strong>Description</strong><br>\nThis dataset is the synthetic part of the DESED dataset. It allows creating mixtures of isolated sounds and backgrounds.</p>\n\n<p>There is the material to:</p>\n\n<ul>\n\t<li>Reproduce the DCASE 2019 task 4 synthetic dataset</li>\n\t<li>Reproduce the DCASE 2020 task 4 synthetic train dataset</li>\n\t<li>Creating new mixtures from isolated foreground sounds and background sounds.</li>\n</ul>\n\n<p><strong>Files:</strong></p>\n\n<p><strong>If you want to generate new audio mixtures yourself from the original files.</strong></p>\n\n<ol>\n\t<li><strong>DESED_synth_soundbank.tar.gz</strong> : Raw data used to generate mixtures.</li>\n\t<li><strong>DESED_synth_dcase2019jams.tar.gz</strong>: JAMS files, metadata describing how to recreate the&nbsp; dcase2019 synthetic dataset<strong> </strong></li>\n\t<li><strong>DESED_synth_dcase20_train_val_jams.tar: </strong>JAMS files, metadata describing how to recreate the dcase2020 synthetic train and valid dataset.</li>\n\t<li><strong>DESED_synth_dcase20_eval_jams.tar: </strong>JAMS files, metadata describing how to recreate the dcase2020 synthetic eval dataset (only the basic one, variants of it have been made but not presented here).</li>\n</ol>\n\n<p><strong>If you simply want the evaluation synthetic dataset used in DCASE 2019 task 4.</strong></p>\n\n<ol>\n\t<li><strong>DESED_synth_eval_dcase2019.tar.gz</strong><strong> </strong>:<strong> </strong>Evaluation audio and metadata files used in dcase 2019 task 4.</li>\n</ol>\n\n<p>&nbsp;</p>\n\n<p>The mixtures are generated using Scaper (https://github.com/justinsalamon/scaper) [1].</p>\n\n<p>* Background files are extracted from SINS [2], MUSAN [3] or Youtube and have been selected because they contain a very low amount of our sound event classes.<br>\n* Foreground files are extracted from Freesound [4][5] and manually verified to check the quality and segmented to remove silences.</p>\n\n<p><strong>References</strong><br>\n[1] J. Salamon, D. MacConnell, M. Cartwright, P. Li, and J. P. Bello. Scaper: A library for soundscape synthesis and augmentation<br>\nIn IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, Oct. 2017.</p>\n\n<p>[2] Gert Dekkers, Steven Lauwereins, Bart Thoen, Mulu Weldegebreal Adhana, Henk Brouckxon, Toon van Waterschoot, Bart Vanrumste, Marian Verhelst, and Peter Karsmakers.<br>\nThe SINS database for detection of daily activities in a home environment using an acoustic sensor network.<br>\nIn Proceedings of the Detection and Classification of Acoustic Scenes and Events 2017 Workshop (DCASE2017), 32&ndash;36. November 2017.</p>\n\n<p>[3] David Snyder and Guoguo Chen and Daniel Povey.<br>\nMUSAN: A Music, Speech, and Noise Corpus.<br>\narXiv, 1510.08484, 2015.</p>\n\n<p>[4] F. Font, G. Roma &amp; X. Serra. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia. ACM, 2013.<br>\n&nbsp;<br>\n[5] E. Fonseca, J. Pons, X. Favory, F. Font, D. Bogdanov, A. Ferraro, S. Oramas, A. Porter &amp; X. Serra. Freesound Datasets: A Platform for the Creation of Open Audio Datasets.<br>\nIn Proceedings of the 18th International Society for Music Information Retrieval Conference, Suzhou, China, 2017.</p>\n\n<p>&nbsp;</p>", 
    "contributors": [
      {
        "affiliation": "Adobe Research, San Francisco CA, United States", 
        "type": "Researcher", 
        "name": "Salamon, Justin"
      }, 
      {
        "affiliation": "Language Technologies Institute, Carnegie Mellon University, Pittsburgh PA, United States", 
        "type": "Researcher", 
        "name": "Shah, Ankit"
      }, 
      {
        "affiliation": "Google, Inc", 
        "type": "Researcher", 
        "name": "Wisdom, Scott"
      }, 
      {
        "affiliation": "Google, Inc", 
        "type": "Researcher", 
        "name": "Hershey, John"
      }, 
      {
        "affiliation": "Google, Inc", 
        "type": "Researcher", 
        "name": "Erdogan, Hakan"
      }
    ], 
    "title": "DESED_synthetic", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "relations": {
      "version": [
        {
          "count": 20, 
          "index": 16, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "3550598"
          }, 
          "is_last": false, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "4569096"
          }
        }
      ]
    }, 
    "communities": [
      {
        "id": "dcase"
      }
    ], 
    "version": "v2.3", 
    "keywords": [
      "DCASE", 
      "Sound event detection"
    ], 
    "publication_date": "2020-03-07", 
    "creators": [
      {
        "affiliation": "Universit\u00e9 de Lorraine, CNRS, Inria, Loria, F-54000 Nancy, France", 
        "name": "Turpault, Nicolas"
      }, 
      {
        "affiliation": "Universit\u00e9 de Lorraine, CNRS, Inria, Loria, F-54000 Nancy, France", 
        "name": "Serizel, Romain"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "video", 
      "title": "Video/Audio"
    }, 
    "related_identifiers": [
      {
        "scheme": "url", 
        "identifier": "https://hal.inria.fr/hal-02160855v2", 
        "relation": "isSupplementTo", 
        "resource_type": "publication-conferencepaper"
      }, 
      {
        "scheme": "url", 
        "identifier": "https://hal.inria.fr/hal-02355573", 
        "relation": "isSupplementTo", 
        "resource_type": "publication-conferencepaper"
      }, 
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.3550598", 
        "relation": "isVersionOf"
      }
    ]
  }
}
2,600
9,586
views
downloads
All versions This version
Views 2,60020
Downloads 9,5860
Data volume 32.6 TB0 Bytes
Unique views 1,90417
Unique downloads 4,3850

Share

Cite as