Published May 23, 2018 | Version 1.1.2
Software Open

VocalSketch Data Set 1.1.2


This dataset contains thousands of vocal imitations of a large set of diverse sounds. These imitations were collected from hundreds of contributors via Amazon's Mechanical Turk website. The dataset also contains data on hundreds of people's ability to correctly label these vocal imitations, also collected via Amazon's Mechanical Turk. This data set will help the research community understand which audio concepts can be effectively communicated with this approach. We have released this data so the community can study the related issues and build systems that leverage vocal imitation as an interaction modality, such as search engines that can be queried by vocally imitating the desired sound.

See for the latest updates to this dataset.

Interactive Audio Lab:



Files (4.6 GB)

Additional details