Published December 6, 2024 | Version v1
Dataset Open

An audio dataset for sound effect variations

Authors/Creators

  • 1. UNIVERSITY OF SYDNEY

Description

This is a sound effect dataset constructed to test neural network's capability of generating diverse categories of sounds. It is composed of two categories: Footstep-set and Impact-set. All sounds are sourced from Freesound. The Footstep-set contains 9 categories with 2127 sounds in total while Impact-set has 7 categories with 1666 sounds in total. Each category is a different variation of the main dataset. For example, the Footstep-set contains footsteps recorded on metal boards and on concrete floors, each showing slightly different timbres of sounds. The Impact-set contains impulsive sounds commonly used in games such as gunshots, explosion, punch, etc. 

Each dataset is separated into training (90%) and testing (10%). All sounds are sampled in 44.1khz, 16bit, 4-second mono files. The metadata for each dataset records the index of the sound, category, ID for retrieving the link of Freesound, Sound Creator, Name for the sound file, and the train/test set split.

This dataset was used to train and test our model in a recent paper that was submitted but under review. We will upload the accompanying paper once it's been accepted.

Files

Footstep-set.zip

Files (683.4 MB)

Name Size Download all
md5:5e71ece8ae674a762e938b5ba017022d
476.4 MB Preview Download
md5:f23e2d4b67bf7b31e07082ddee7e892c
207.0 MB Preview Download

Additional details

Dates

Submitted
2024-12-06

References

  • Font, F., Roma, G., & Serra, X. (2013). Freesound technical demo. Proceedings of the 21st ACM international conference on Multimedia.