Software Open Access
Mark Cartwright; Bongjun Kim; Bryan Pardo
This dataset contains thousands of vocal imitations of a large set of diverse sounds. These imitations were collected from hundreds of contributors via Amazon's Mechanical Turk website. The dataset also contains data on hundreds of people's ability to correctly label these vocal imitations, also collected via Amazon's Mechanical Turk. This data set will help the research community understand which audio concepts can be effectively communicated with this approach. We have released this data so the community can study the related issues and build systems that leverage vocal imitation as an interaction modality, such as search engines that can be queried by vocally imitating the desired sound.
See https://github.com/interactiveaudiolab/VocalSketchDataSet for the latest updates to this dataset.
Interactive Audio Lab: http://music.eecs.northwestern.edu