Dataset Open Access

Vocal Imitation Set v1.1.3 : Thousands of vocal imitations of hundreds of sounds from the AudioSet ontology

Bongjun Kim; Bryan Pardo


JSON-LD (schema.org) Export

{
  "description": "<p>The VocalImitationSet is a collection of crowd-sourced vocal imitations of a large set of diverse sounds collected from Freesound (<a href=\"https://freesound.org/\">https://freesound.org/</a>), which were curated based on Google&#39;s AudioSet ontology (<a href=\"https://research.google.com/audioset/\">https://research.google.com/audioset/</a>). We expect that this dataset will help research communities obtain a better understanding of human&#39;s vocal imitation and build a machine understand the imitations as humans do.</p>\n\n<p>See&nbsp;<a href=\"https://github.com/interactiveaudiolab/VocalImitationSet\">https://github.com/interactiveaudiolab/VocalImitationSet</a> for more information about this dataset and its latest updates.</p>\n\n<p>For citations, please use this reference:</p>\n\n<p>Bongjun Kim, Madhav Ghei, Bryan Pardo, and Zhiyao Duan, &quot;Vocal Imitation Set: a dataset of vocally imitated sound events using the AudioSet ontology,&quot;&nbsp;<em>Proceedings of the Detection and Classification of Acoustic Scenes and Events 2018 Workshop (DCASE2018)</em>, Nov. 2018.</p>\n\n<p>Contact Info:</p>\n\n<p>- Interactive Audio Lab: <a href=\"http://music.eecs.northwestern.edu/\">http://music.eecs.northwestern.edu</a></p>\n\n<p>- Bongjun Kim&nbsp;<a href=\"mailto:bongjun@u.northwestern.edu\">bongjun@u.northwestern.edu</a>&nbsp;|&nbsp;<a href=\"http://www.bongjunkim.com/\">http://www.bongjunkim.com</a></p>\n\n<p>- Bryan Pardo&nbsp;<a href=\"mailto:pardo@northwestern.edu\">pardo@northwestern.edu</a>&nbsp;|&nbsp;<a href=\"http://www.bryanpardo.com/\">http://www.bryanpardo.com</a></p>", 
  "license": "http://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "Northwestern University", 
      "@type": "Person", 
      "name": "Bongjun Kim"
    }, 
    {
      "affiliation": "Northwestern University", 
      "@type": "Person", 
      "name": "Bryan Pardo"
    }
  ], 
  "sameAs": [
    "https://github.com/interactiveaudiolab/VocalImitationSet/releases/tag/v1.0"
  ], 
  "datePublished": "2018-08-06", 
  "url": "https://zenodo.org/record/1340763", 
  "keywords": [
    "vocal imitation", 
    "sound", 
    "audio information retrieval", 
    "audio"
  ], 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/de07c32f-133c-4654-890f-74d019867dca/VocalImitationSet_v1.1.3.zip", 
      "encodingFormat": "zip", 
      "@type": "DataDownload"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.1340763", 
  "@id": "https://doi.org/10.5281/zenodo.1340763", 
  "@type": "Dataset", 
  "name": "Vocal Imitation Set v1.1.3 : Thousands of vocal imitations of hundreds of sounds from the AudioSet ontology"
}
625
2,784
views
downloads
All versions This version
Views 625448
Downloads 2,7842,754
Data volume 21.0 TB20.9 TB
Unique views 562421
Unique downloads 296273

Share

Cite as