An audio dataset for sound effect variations
Description
This is a sound effect dataset constructed to test neural network's capability of generating diverse categories of sounds. It is composed of two categories: Footstep-set and Impact-set. All sounds are sourced from Freesound. The Footstep-set contains 9 categories with 2127 sounds in total while Impact-set has 7 categories with 1666 sounds in total. Each category is a different variation of the main dataset. For example, the Footstep-set contains footsteps recorded on metal boards and on concrete floors, each showing slightly different timbres of sounds. The Impact-set contains impulsive sounds commonly used in games such as gunshots, explosion, punch, etc.
Each dataset is separated into training (90%) and testing (10%). All sounds are sampled in 44.1khz, 16bit, 4-second mono files. The metadata for each dataset records the index of the sound, category, ID for retrieving the link of Freesound, Sound Creator, Name for the sound file, and the train/test set split.
This dataset was used to train and test our model in a recent paper that was submitted but under review. We will upload the accompanying paper once it's been accepted.
Files
Footstep-set.zip
Files
(683.4 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:5e71ece8ae674a762e938b5ba017022d
|
476.4 MB | Preview Download |
|
md5:f23e2d4b67bf7b31e07082ddee7e892c
|
207.0 MB | Preview Download |
Additional details
Dates
- Submitted
-
2024-12-06
References
- Font, F., Roma, G., & Serra, X. (2013). Freesound technical demo. Proceedings of the 21st ACM international conference on Multimedia.