Dataset Open Access

Vocal Imitation Set v1.1.3 : Thousands of vocal imitations of hundreds of sounds from the AudioSet ontology

Bongjun Kim; Bryan Pardo


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/de07c32f-133c-4654-890f-74d019867dca/VocalImitationSet_v1.1.3.zip"
      }, 
      "checksum": "md5:386e7b1487fe0800ade4916c344086bc", 
      "bucket": "de07c32f-133c-4654-890f-74d019867dca", 
      "key": "VocalImitationSet_v1.1.3.zip", 
      "type": "zip", 
      "size": 7587649790
    }
  ], 
  "owners": [
    8712
  ], 
  "doi": "10.5281/zenodo.1340763", 
  "stats": {
    "version_unique_downloads": 301.0, 
    "unique_views": 467.0, 
    "views": 498.0, 
    "version_views": 676.0, 
    "unique_downloads": 278.0, 
    "version_unique_views": 609.0, 
    "volume": 20949501070190.0, 
    "version_downloads": 2791.0, 
    "downloads": 2761.0, 
    "version_volume": 21087798579980.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.1340763", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.1249723", 
    "bucket": "https://zenodo.org/api/files/de07c32f-133c-4654-890f-74d019867dca", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.1249723.svg", 
    "html": "https://zenodo.org/record/1340763", 
    "latest_html": "https://zenodo.org/record/1340763", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.1340763.svg", 
    "latest": "https://zenodo.org/api/records/1340763"
  }, 
  "conceptdoi": "10.5281/zenodo.1249723", 
  "created": "2018-08-07T13:25:30.939378+00:00", 
  "updated": "2020-01-24T19:26:07.856400+00:00", 
  "conceptrecid": "1249723", 
  "revision": 6, 
  "id": 1340763, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.1340763", 
    "description": "<p>The VocalImitationSet is a collection of crowd-sourced vocal imitations of a large set of diverse sounds collected from Freesound (<a href=\"https://freesound.org/\">https://freesound.org/</a>), which were curated based on Google&#39;s AudioSet ontology (<a href=\"https://research.google.com/audioset/\">https://research.google.com/audioset/</a>). We expect that this dataset will help research communities obtain a better understanding of human&#39;s vocal imitation and build a machine understand the imitations as humans do.</p>\n\n<p>See&nbsp;<a href=\"https://github.com/interactiveaudiolab/VocalImitationSet\">https://github.com/interactiveaudiolab/VocalImitationSet</a> for more information about this dataset and its latest updates.</p>\n\n<p>For citations, please use this reference:</p>\n\n<p>Bongjun Kim, Madhav Ghei, Bryan Pardo, and Zhiyao Duan, &quot;Vocal Imitation Set: a dataset of vocally imitated sound events using the AudioSet ontology,&quot;&nbsp;<em>Proceedings of the Detection and Classification of Acoustic Scenes and Events 2018 Workshop (DCASE2018)</em>, Nov. 2018.</p>\n\n<p>Contact Info:</p>\n\n<p>- Interactive Audio Lab: <a href=\"http://music.eecs.northwestern.edu/\">http://music.eecs.northwestern.edu</a></p>\n\n<p>- Bongjun Kim&nbsp;<a href=\"mailto:bongjun@u.northwestern.edu\">bongjun@u.northwestern.edu</a>&nbsp;|&nbsp;<a href=\"http://www.bongjunkim.com/\">http://www.bongjunkim.com</a></p>\n\n<p>- Bryan Pardo&nbsp;<a href=\"mailto:pardo@northwestern.edu\">pardo@northwestern.edu</a>&nbsp;|&nbsp;<a href=\"http://www.bryanpardo.com/\">http://www.bryanpardo.com</a></p>", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "title": "Vocal Imitation Set v1.1.3 : Thousands of vocal imitations of hundreds of sounds from the AudioSet ontology", 
    "relations": {
      "version": [
        {
          "count": 2, 
          "index": 1, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "1249723"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "1340763"
          }
        }
      ]
    }, 
    "keywords": [
      "vocal imitation", 
      "sound", 
      "audio information retrieval", 
      "audio"
    ], 
    "publication_date": "2018-08-06", 
    "creators": [
      {
        "affiliation": "Northwestern University", 
        "name": "Bongjun Kim"
      }, 
      {
        "affiliation": "Northwestern University", 
        "name": "Bryan Pardo"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "dataset", 
      "title": "Dataset"
    }, 
    "related_identifiers": [
      {
        "scheme": "url", 
        "identifier": "https://github.com/interactiveaudiolab/VocalImitationSet/releases/tag/v1.0", 
        "relation": "isIdenticalTo"
      }, 
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.1249723", 
        "relation": "isVersionOf"
      }
    ]
  }
}
676
2,791
views
downloads
All versions This version
Views 676498
Downloads 2,7912,761
Data volume 21.1 TB20.9 TB
Unique views 609467
Unique downloads 301278

Share

Cite as