Dataset Open Access

VimSketch Dataset

Bongjun Kim; Mark Cartwright; Fatemeh Pishdadian; Bryan Pardo

VimSketch Dataset combines two publicly available datasets, created by the Interactive Audio Lab:

  1. Vocal Imitation Set: a collection of crowd-sourced vocal imitations of a large set of diverse sounds collected from Freesound (https://freesound.org/), which were curated based on Google's AudioSet ontology (https://research.google.com/audioset/).
  2. VocalSketch Dataset: a dataset containing thousands of vocal imitations of a large set of diverse sounds.

 

Publications by the Interactive Audio Lab using VimSketch:

[pdf] Fatemeh Pishdadian, Bongjun Kim, Prem Seetharaman, Bryan Pardo. "Classifying Non-speech Vocals: Deep vs Signal Processing Representations," Detection and Classification of Acoustic Scenes and Events Workshop (DCASE), 2019

 

Contact information:

- Interactive Audio Lab: http://music.eecs.northwestern.edu

- Bryan Pardo pardo@northwestern.edu | http://www.bryanpardo.com

- Bongjun Kim bongjun@u.northwestern.edu | http://www.bongjunkim.com

- Fatemeh Pishdadian fpishdadian@u.northwestern.edu | http://www.fatemehpishdadian.com

 

 

Files (4.5 GB)
Name Size
Vim_Sketch_Dataset.zip
md5:5f743eb8cf98040b55b4f27839666334
4.5 GB Download
160
16
views
downloads
All versions This version
Views 160160
Downloads 1616
Data volume 71.8 GB71.8 GB
Unique views 131131
Unique downloads 1616

Share

Cite as