Dataset Open Access
Bongjun Kim; Bryan Pardo
The VocalImitationSet is a collection of crowd-sourced vocal imitations of a large set of diverse sounds collected from Freesound (https://freesound.org/), which were curated based on Google's AudioSet ontology (https://research.google.com/audioset/). We expect that this dataset will help research communities obtain a better understanding of human's vocal imitation and build a machine understand the imitations as humans do.
See https://github.com/interactiveaudiolab/VocalImitationSet for more information about this dataset and its latest updates.
For citations, please use this reference:
Bongjun Kim, Madhav Ghei, Bryan Pardo, and Zhiyao Duan, "Vocal Imitation Set: a dataset of vocally imitated sound events using the AudioSet ontology," Proceedings of the Detection and Classification of Acoustic Scenes and Events 2018 Workshop (DCASE2018), Nov. 2018.
Contact Info:
- Interactive Audio Lab: http://music.eecs.northwestern.edu
- Bongjun Kim bongjun@u.northwestern.edu | http://www.bongjunkim.com
- Bryan Pardo pardo@northwestern.edu | http://www.bryanpardo.com
Name | Size | |
---|---|---|
VocalImitationSet_v1.1.3.zip
md5:386e7b1487fe0800ade4916c344086bc |
7.6 GB | Download |
All versions | This version | |
---|---|---|
Views | 2,209 | 1,929 |
Downloads | 2,995 | 2,959 |
Data volume | 22.6 TB | 22.5 TB |
Unique views | 1,933 | 1,728 |
Unique downloads | 471 | 442 |