Dataset Open Access

Vocal Imitation Set v1.1.3 : Thousands of vocal imitations of hundreds of sounds from the AudioSet ontology

Bongjun Kim; Bryan Pardo

The VocalImitationSet is a collection of crowd-sourced vocal imitations of a large set of diverse sounds collected from Freesound (, which were curated based on Google's AudioSet ontology ( We expect that this dataset will help research communities obtain a better understanding of human's vocal imitation and build a machine understand the imitations as humans do.

See for more information about this dataset and its latest updates.

For citations, please use this reference:

Bongjun Kim, Madhav Ghei, Bryan Pardo, and Zhiyao Duan, "Vocal Imitation Set: a dataset of vocally imitated sound events using the AudioSet ontology," Proceedings of the Detection and Classification of Acoustic Scenes and Events 2018 Workshop (DCASE2018), Nov. 2018.

Contact Info:

- Interactive Audio Lab:

- Bongjun Kim |

- Bryan Pardo |

Files (7.6 GB)
Name Size
7.6 GB Download
All versions This version
Views 2,2091,929
Downloads 2,9952,959
Data volume 22.6 TB22.5 TB
Unique views 1,9331,728
Unique downloads 471442


Cite as