Published September 2, 2020 | Version 2.0
Dataset Open


  • 1. ZHAW Datalab & USi
  • 2. ZHAW Datalab
  • 3. TU Wien
  • 4. The Swiss AI Lab IDSIA (USI & SUPSI)


The DeepScoresV2 Dataset for Music Object Detection contains digitally rendered images of written sheet music, together with the corresponding ground truth to fit various types of machine learning models. A total of 151 Million different instances of music symbols, belonging to 135 different classes are annotated. The total Dataset contains 255,385 Images. For most researches, the dense version, containing 1714 of the most diverse and interesting images, should suffice.

The dataset contains ground in the form of:

  • Non-oriented bounding boxes
  • Oriented bounding boxes
  • Semantic segmentation
  • Instance segmentation

The accompaning paper The DeepScoresV2 Dataset and Benchmark for Music Object Detection published at ICPR2020 can be found here:


A toolkit for convenient loading and inspection of the data can be found here:

Code to train baseline models can be found here:




The authors are grateful for the support through Innosuisse grant No. 34301.1 IP-ICT "RealScore", EuropeanResearch Council Advanced Grant 742870, and the continued fruitful collaboration with ScorePad AG.


Files (81.7 GB)

Name Size Download all
80.9 GB Download
741.8 MB Download