There is a newer version of the record available.

Published February 12, 2020 | Version 1.0
Dataset Open

Freesound One-Shot Percussive Sounds

  • 1. Music Technology Group, Universitat Pompeu Fabra
  • 2. Joint Research Centre, European Commission

Description

Freesound One-Shot Percussive Sounds Dataset

This dataset contains 10254 one-shot (single event) percussive sounds from Freesound.org and the corresponding timbral analysis. These were used to train the generative model for "Neural Percussive Synthesis Parameterised by High-Level Timbral Features".

Dataset Construction

To collect this dataset, the following steps were performed:

  • Freesound was queried with words associated with percussive instruments, such as "percussion", "kick", "wood" or "clave". Only sounds with less than one second of effective duration were selected.

  • This stage retrieved some audio clips that contained multiple sound events or that were of low quality. Therefore, we listened to all the retrieved sounds and manually discarded the sounds presenting one of these characteristics. For this, the percussive-annotator was used.

  • The sounds were then cut or padded to have 1-second length, normalized and downsampled to 16kHz.

  • Finally, the sounds were analyzed with the AudioCommons Extractor, to obtain the AudioCommons timbral descriptors. This information is contained in the 'analysis' folder.

Dataset Organisation

The dataset contains two folders and two files in the root directory:

  • 'one_shot_percussive_sounds' encloses the pre-processed audio files. These are named '<freesound_sound_id>.wav'

  • 'analysis' holds the AudioCommons analysis files for each of the sounds in the dataset. This analysis is stored as a .json file, named '<freesound_sound_id>_analysis.json', with a key for each of the features extracted.

  • Two more files are present in the root directory of the dataset: this 'README' and the 'licenses.json'. The latter one is a '.json' file containing the name, the username of the uploader and the license for each of the sounds in the dataset.

Authors and Contact

This dataset was developed by António Ramires, Pritish Chadna, Xavier Favory, Emilia Gómez and Xavier Serra.

Any questions related to this dataset please contact:

António Ramires

antonio.ramires@upf.edu

aframires@gmail.com

References

Please cite this paper if you use this dataset:

@inproceedings{ramires2020, author = "Antonio Ramires and Pritish Chandna and Xavier Favory and Emilia Gómez and Xavier Serra", title = "Neural Percussive Synthesis Parametrerised by High-Level Timbral Features", booktitle = "Proc. of the IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP)", year = "2020" }

Acknowledgements

This work has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 765068 (MIP-Frontiers).

This work has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No. 770376 (TROMPA).

Files

analysis.zip

Files (119.6 MB)

Name Size Download all
md5:c67ce39d5aa6c6a7f88eedf7eb7d933e
5.6 MB Preview Download
md5:25f95a0e38d3ac4ae868f56c378fbccb
1.4 MB Preview Download
md5:278994c2a7b92a24a4daad99f40c13db
112.6 MB Preview Download
md5:afec91c033db607e2fc83c09940abd15
3.2 kB Preview Download

Additional details

Related works

Is supplement to
Preprint: arXiv:1911.11853 (arXiv)

Funding

TROMPA – Towards Richer Online Music Public-domain Archives 770376
European Commission
MIP-Frontiers – New Frontiers in Music Information Processing 765068
European Commission