ThumbSet - A dataset of partial annotations for Automatic Piano Fingering
Description
Here we introduce ThumbSet, an open dataset created from the collection of MusicXML piano scores published as public domain on the MuseScore website, which include finger label annotations. We relied on music publishers who added partial or full annotations to support piano learning when creating this dataset. It is important to note that these annotations reflect a single editor's expertise and do not represent a global ground truth. The source of the annotations on the MuseScore website is not always clear. Some scores may have incorrectly engraved labels, while others may have finger annotations provided by non-expert users. Consequently, the dataset may contain a significant amount of noise, resulting in lower data quality compared to the PIG dataset.
ThumbSet is composed of 2523 music scores, as shown in Table 1. The genres, transcription quality, and fingering quality are highly heterogeneous, and the difficulty level of the pieces tends to lean towards the early years of music education compared to the PIG dataset. However, it is not possible to quantify all these claims with the existing metadata. We can assert that there are more finger labels annotated in the right hand (61%) than in the left hand (39%), although there are more pieces with only left-hand annotations (742) than with only right-hand annotations (153). The proportion of annotated fingers and the window lengths are similar in both hands.
To make ThumbSet available for research purposes, we sliced the data into several windows. Each window takes into account the context, other notes, and symbols, ranging from 32 to 64 notes and symbols around each symbol of interest, including other finger label annotations. Additionally, the excerpts are encoded in the PIG encoding, a text format proposed in a previous study, which does not allow reverting to the original score to protect the copyright of the pieces. We distribute ThumbSet as variable-length music windows in the PIG encoding format. This format contains information about pitch, time onset, time offset, and finger label annotations, if they exist for all notes. Access to the data is limited and available upon request through the Zenodo platform. We also provide links to all the MuseScore source pieces used in creating ThumbSet.
For citation and more information, please refer to the following Article:
@inproceedings{ramoneda2022automatic,
title={Automatic Piano Fingering from Partially Annotated Scores using Autoregressive Neural Networks},
author={Pedro Ramoneda and Dasaem Jeong and Eita Nakamura and Xavier Serra and Marius Miron},
booktitle={Proceedings of the 30th ACM International Conference on Multimedia (MM '22)},
year={2022},
month={October 10--14},
location={Lisboa, Portugal}
}
Disclaimer: Although the name may suggest that there are only thumb fingers annotated on the new dataset, it only indicates the fact that the most typical partial annotations are those suggesting finger crossing, and also that the authors like Tom Thumb story.