Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published November 12, 2019 | Version v1
Video/Audio Open

Fine-grained Vocal Imitation Set

  • 1. Northwestern University

Description

This dataset includes 763 vocal imitations of 108 sound events. The sound event recordings were taken from a subset of Vocal Imitation Set (zenodo.org/record/1340763). While the original VocalImitationSet only contains vocal imitations of a single reference recording per class, this new dataset contains vocal imitations of multiple reference recordings per class. Class names and filenames in this dataset are matched with the VocalImitationSet. Read the following paper to get more detailed information about VocalImitationSet.

[pdf] Bongjun Kim, Madhav Ghei, Bryan Pardo, and Zhiyao Duan, "Vocal Imitation Set: a dataset of vocally imitated sound events using the AudioSet ontology," *Proceedings of the Detection and Classification of Acoustic Scenes and Events 2018 Workshop (DCASE2018)*, Nov. 2018.

Contact Info:

- Interactive Audio Lab: http://music.eecs.northwestern.edu

- Bongjun Kim bongjun@u.northwestern.edu | http://www.bongjunkim.com

- Bryan Pardo pardo@northwestern.edu | http://www.bryanpardo.com

Files

Fine-grained_VocalImitationSet.zip

Files (419.4 MB)

Name Size Download all
md5:d0de20ad2a5a2c29ef6f897a90d00a62
419.4 MB Preview Download