UPDATE: Zenodo migration postponed to Oct 13 from 06:00-08:00 UTC. Read the announcement.
There is a newer version of this record available.

Dataset Open Access

Freesound One-Shot Percussive Sounds

António Ramires; Pritish Chandna; Xavier Favory; Emilia Gómez; Xavier Serra

Freesound One-Shot Percussive Sounds Dataset

This dataset contains 10254 one-shot (single event) percussive sounds from Freesound.org and the corresponding timbral analysis. These were used to train the generative model for "Neural Percussive Synthesis Parameterised by High-Level Timbral Features".

Dataset Construction

To collect this dataset, the following steps were performed:

  • Freesound was queried with words associated with percussive instruments, such as "percussion", "kick", "wood" or "clave". Only sounds with less than one second of effective duration were selected.

  • This stage retrieved some audio clips that contained multiple sound events or that were of low quality. Therefore, we listened to all the retrieved sounds and manually discarded the sounds presenting one of these characteristics. For this, the percussive-annotator was used.

  • The sounds were then cut or padded to have 1-second length, normalized and downsampled to 16kHz.

  • Finally, the sounds were analyzed with the AudioCommons Extractor, to obtain the AudioCommons timbral descriptors. This information is contained in the 'analysis' folder.

Dataset Organisation

The dataset contains two folders and two files in the root directory:

  • 'one_shot_percussive_sounds' encloses the pre-processed audio files. These are named '<freesound_sound_id>.wav'

  • 'analysis' holds the AudioCommons analysis files for each of the sounds in the dataset. This analysis is stored as a .json file, named '<freesound_sound_id>_analysis.json', with a key for each of the features extracted.

  • Two more files are present in the root directory of the dataset: this 'README' and the 'licenses.json'. The latter one is a '.json' file containing the name, the username of the uploader and the license for each of the sounds in the dataset.

Authors and Contact

This dataset was developed by António Ramires, Pritish Chadna, Xavier Favory, Emilia Gómez and Xavier Serra.

Any questions related to this dataset please contact:

António Ramires

antonio.ramires@upf.edu

aframires@gmail.com

References

Please cite this paper if you use this dataset:

@inproceedings{ramires2020, author = "Antonio Ramires and Pritish Chandna and Xavier Favory and Emilia Gómez and Xavier Serra", title = "Neural Percussive Synthesis Parametrerised by High-Level Timbral Features", booktitle = "Proc. of the IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP)", year = "2020" }

Acknowledgements

This work has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 765068 (MIP-Frontiers).

This work has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No. 770376 (TROMPA).

Files (119.6 MB)
Name Size
analysis.zip
md5:c67ce39d5aa6c6a7f88eedf7eb7d933e
5.6 MB Download
licenses.txt
md5:25f95a0e38d3ac4ae868f56c378fbccb
1.4 MB Download
one_shot_percussive_sounds.zip
md5:278994c2a7b92a24a4daad99f40c13db
112.6 MB Download
README.md
md5:afec91c033db607e2fc83c09940abd15
3.2 kB Download
2,025
1,739
views
downloads
All versions This version
Views 2,0251,809
Downloads 1,739800
Data volume 99.8 GB37.1 GB
Unique views 1,8071,659
Unique downloads 823379

Share

Cite as