Dataset Open Access

MSMD - Multimodal Sheet Music Dataset

Dorfer, Matthias; Hajič, Jan, jr.; Arzt, Andreas; Frostel, Harald; Widmer, Gerhard

Project member(s)
Balke, Stefan; Henkel, Florian

MSMD is a synthetic dataset of 497 pieces of (classical) music that contains both audio and score representations of the pieces aligned at a fine-grained level (344,742 pairs of noteheads aligned to their audio/MIDI counterpart). It can be used for training and evaluating multimodal models that enable crossing from one modality to the other, such as retrieving sheet music using recordings or following a performance in the score image.

Please find further information and a corresponding Python package on this Github page: https://github.com/CPJKU/msmd

If you use this dataset, please cite:
[1] Matthias Dorfer, Jan Hajič jr., Andreas Arzt, Harald Frostel, Gerhard Widmer.
Learning Audio-Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification (PDF).
Transactions of the International Society for Music Information Retrieval, issue 1, 2018.

Files (9.6 GB)
Name Size
msmd_aug_v1-1_no-audio.zip
md5:cf843c481c2ff811d9b26c6ca7df60ab
9.6 GB Download
1,815
4,498
views
downloads
All versions This version
Views 1,8151,582
Downloads 4,4984,467
Data volume 42.8 TB42.7 TB
Unique views 1,5941,411
Unique downloads 1,1471,120

Share

Cite as