Published May 12, 2022 | Version 1.1.0
Dataset Open

ATEPP: A Dataset of Automatically Transcribed Expressive Piano Performance

  • 1. Zhang
  • 2. Tang
  • 3. Rafee
  • 4. Dixon
  • 5. Fazekas
  • 6. Wiggins

Description

ATEPP is a dataset of expressive piano performances by virtuoso pianists. The dataset contains 11742 11677 performances (~1000 hours) by 49 pianists and covers 1580 movements by 25 composers. All of the MIDI files in the dataset come from the piano transcription of existing audio recordings of piano performances. Scores in MusicXML format are also available for around half of the tracks. The dataset is organized and aligned by compositions and movements for comparative studies. For more details, please check here

Notes

When creating ATEPP Version-1.0.x, we only applied movement-wise matching to remove erroneously downloaded audio. Now, we finished detecting repeated audios by audio-wise fingerprint matching. Only 65 audios were detected repeated, and the corresponding transcribed midi files were removed.

Files

Appendix.pdf

Files (227.9 MB)

Name Size Download all
md5:eb365c6049f4803049f9abdeb49c7233
2.8 MB Preview Download
md5:4a670fea0d2a456cc58226493a0778f4
221.6 MB Preview Download
md5:021eb9f9d5922660854a8167a2a052a1
3.5 MB Preview Download

Additional details

Funding

UK Research and Innovation
UKRI Centre for Doctoral Training in Artificial Intelligence and Music EP/S022694/1