Expanded Groove MIDI Dataset
Description
The Expanded Groove MIDI Dataset (E-GMD), a large dataset of human drum performances, with audio recordings annotated in MIDI. E-GMD contains 444 hours of audio from 43 drum kits and is an order of magnitude larger than similar datasets. It is also the first human-performed drum dataset with annotations of velocity.
Additional information is available on the Magenta website: The Expanded Groove MIDI Dataset
If you use the E-GMD dataset in your work, please cite the paper where it was introduced:
Lee Callender, Curtis Hawthorne, and Jesse Engel. "Improving Perceptual Quality of Drum Transcription with the Expanded Groove MIDI Dataset." 2020. arXiv:2004.00188.
You can also use the following BibTeX entry:
@misc{callender2020improving, title={Improving Perceptual Quality of Drum Transcription with the Expanded Groove MIDI Dataset}, author={Lee Callender and Curtis Hawthorne and Jesse Engel}, year={2020}, eprint={2004.00188}, archivePrefix={arXiv}, primaryClass={cs.SD} }
Please also make sure to specify which version of the dataset you are using.
Files
e-gmd-v0.0.1.zip
Files
(96.4 GB)
Name | Size | Download all |
---|---|---|
md5:dff1bfc4977a1b415e4e4d97169d3e90
|
96.4 GB | Preview Download |
md5:a212cb4b1aec205ef1882d9f9bb6150a
|
9.6 MB | Preview Download |