StemGMD: A Large-Scale Audio Dataset of Isolated Drum Stems for Deep Drums Demixing - part 1
- 1. Politecnico di Milano
Description
We introduce StemGMD, a new large-scale dataset of isolated drum stems that builds upon the extensive MIDI collection found in Magenta's Groove MIDI Dataset (GMD).
GMD is a 13.6-hour corpus of expressive drum performances executed by ten drummers on a Roland TD-11 electronic drum kit. It contains 1150 MIDI files along with the corresponding full-kit audio mixtures.
As a first step in creating StemGMD, we mapped the 22 different MIDI pitches found in the original files onto nine canonical instruments through the reduction scheme proposed in J. Gillick, A. Roberts, J. Engel, D. Eck, and D. Bamman, "Learning to groove with inverse sequence transformations," in International Conference on Machine Learning (ICML), vol. 97, 2019, pp. 2269–2279.
Each of the nine resulting MIDI channels was manually synthesized as a 16-bit/44.1 kHz stereo WAV file using ten realistic-sounding acoustic drum kits sourced from the Logic Pro X sample libraries, i.e., Bluebird, Brooklyn, Detroit Garage, East Bay, Heavy, Motown Revisited, Portland, Retro Rock, Roots, and SoCal.
As a result, StemGMD contains 1224 hours of audio, which correspond to more than 136 hours of full-kit mixtures. Moreover, StemGMD also contains single hits for each of the drum pieces at ten different velocities ranging from 30 to 127.
To the best of our knowledge, StemGMD is the largest publicly available dataset of drums to date. Moreover, it is the first collection of single-instrument clips from all nine pieces in a canonical drum kit, making it well-suited for training deep drums demixing models.
*** THIS IS PART 1 OF 2 ***
Download part 2 here: https://zenodo.org/records/7882857 (now available!)
After downloading both parts, run unzip_StemGMD.sh to build the dataset from the split archive files.
Once unzipped, StemGMD will take just over 1.13 TB of memory.
The dataset is made available under a Creative Commons Attribution 4.0 International (CC BY 4.0) License.
____________________________
We employed the dataset in our paper titled "Toward Deep Drum Source Separation," published in Pattern Recognition Letters.
Please, cite this work as: A. I. Mezza, R. Giampiccolo, A. Bernardini, and A. Sarti, "Toward Deep Drum Source Separation," Pattern Recognition Letters, vol. 183, pp. 86-91, 2024, doi: 10.1016/j.patrec.2024.04.026.
@article{mezza2024, title = {Toward deep drum source separation}, author = {Alessandro Ilic Mezza and Riccardo Giampiccolo and Alberto Bernardini and Augusto Sarti}, journal = {Pattern Recognition Letters}, volume = {183}, pages = {86-91}, year = {2024}, issn = {0167-8655}, doi = {https://doi.org/10.1016/j.patrec.2024.04.026} }
Files
Files
(186.8 GB)
Name | Size | Download all |
---|---|---|
md5:5c83ff4ffb7d1f8957a50362e07c8972
|
31.1 GB | Download |
md5:2e9b549bf614e32bc802cba812490029
|
31.1 GB | Download |
md5:3d55dff28d94d6bb481c216a30adc082
|
31.1 GB | Download |
md5:3b00c10eb3604815b1ac5a549aa2e524
|
31.1 GB | Download |
md5:1243b28b6fcfbc2196e01c0a03868088
|
31.1 GB | Download |
md5:2f658a4d6fac11f8a5d420716ca5ecb2
|
31.1 GB | Download |