There is a newer version of this record available.

Dataset Open Access

Vocal Imitation Set v1.0 : Thousands of vocal imitations of hundreds of sounds from the AudioSet ontology

Bongjun Kim; Bryan Pardo

The VocalImitationSet is a collection of crowd-sourced vocal imitations of a large set of diverse sounds collected from Freesound (, which were curated based on Google's AudioSet ontology ( We expect that this dataset will help research communities obtain a better understanding of human's vocal imitation and build a machine understand the imitations as humans do.

See for the latest updates to this dataset.

Contact Info:

- Interactive Audio Lab:

- Bongjun Kim |

- Bryan Pardo |

Files (4.6 GB)
Name Size
4.6 GB Download
All versions This version
Views 1,196210
Downloads 2,82931
Data volume 21.4 TB142.9 GB
Unique views 1,085190
Unique downloads 33429


Cite as