Dataset Open Access

ORCHSET: a dataset for melody extraction in symphonic music recordings

Bosch, J.; Gomez, E.

Orchset is intended to be used as a dataset for the development and evaluation of melody extraction algorithms. This collection contains 64 audio excerpts focused on symphonic music. with their corresponding annotation of the melody.

Melody is here defined as “the single (monophonic) pitch sequence that a listener might reproduce if asked to whistle or hum a piece of polyphonic music”.

The dataset creation comprised several tasks: excerpts selection, recording sessions of people singing along with the excerpts, analysis of the recordings and melody annotation. A complete description of the dataset and the creation methodology is presented in this paper:

Bosch, J., Marxer, R., Gomez, E., “Evaluation and Combination of Pitch Estimation Methods for Melody Extraction in Symphonic Classical Music”, Journal of New Music Research (2016)

Please Acknowledge Orchset in Academic Research

Using this dataset

When Orchset is used for academic research, we would highly appreciate if scientific publications of works partly based on the Orchset dataset quote the above publication.

We are interested in knowing if you find our datasets useful! If you use our dataset please email us at and tell us about your research.

Files (326.4 MB)
Name Size
326.4 MB Download
All versions This version
Views 1,7981,807
Downloads 802801
Data volume 261.8 GB261.5 GB
Unique views 1,5331,541
Unique downloads 606605


Cite as