Dataset Open Access

VimSketch Dataset

Bongjun Kim; Mark Cartwright; Fatemeh Pishdadian; Bryan Pardo

VimSketch Dataset combines two publicly available datasets, created by the Interactive Audio Lab:

  1. Vocal Imitation Set: a collection of crowd-sourced vocal imitations of a large set of diverse sounds collected from Freesound (, which were curated based on Google's AudioSet ontology (
  2. VocalSketch Dataset: a dataset containing thousands of vocal imitations of a large set of diverse sounds.


Publications by the Interactive Audio Lab using VimSketch:

[pdf] Fatemeh Pishdadian, Bongjun Kim, Prem Seetharaman, Bryan Pardo. "Classifying Non-speech Vocals: Deep vs Signal Processing Representations," Detection and Classification of Acoustic Scenes and Events Workshop (DCASE), 2019


Contact information:

- Interactive Audio Lab:

- Bryan Pardo |

- Bongjun Kim |

- Fatemeh Pishdadian |



Files (4.5 GB)
Name Size
4.5 GB Download
All versions This version
Views 315315
Downloads 4343
Data volume 193.0 GB193.0 GB
Unique views 263263
Unique downloads 3939


Cite as