Dataset Open Access

Jingju a cappella singing pitch contour segmentation ground truth dataset

Gong, Rong; Yang, Yile; Serra, Xavier

The dataset used in the paper:

Gong, Rong; Yang, Yile; Serra, Xavier;  Pitch Contour Segmentation for Computer-aided Jingju Singing Training Sound and Music Computing (SMC 2016), 2016, Hamburg, Germany

is in "dataset" folder. The a cappella singing audio recordings are not contained in this folder due to their large size, please contact the paper authors to request them ( In the "dataset" folder you can find:

  1. ground truth
  2. Jinging singing scores in .xml format used for estimating the bigram note transition probabilities.

The ground truth annotation is used for:

  • melodic transcription (male_12_pos_1 missing)
  • parameter optimization,
  • evaluating the StdCdLe thresholding and the overall segmentation performance.

The subfolder "groundtruth" contains the following annotation for each jingju a cappella audio:

  • file name: description (format)
  • *_melodicTrans.csv: melodic transcription ground truth used for the evaluation (start_time pitch duration -).
  • *_coarseSeg.csv: StdCdLe ground truth used for the parameter optimization and the evaluation (segmentation points).
  • *_refinedSeg.csv: ground truth used for optimizing other parameters and the evaluation (start_time - duration).
  • *_pitchtrack.csv: pitch track (contour) extracted by pYIN pitch-tracking algorithm (filename time pitch).
  • *_monoNoteOut.csv: notes estimated by pYIN note-tracking algorithm (filename start_time duration pitch).



Files (13.1 MB)
Name Size
13.1 MB Download
  • Gong, Rong et al. (2016). Pitch Contour Segmentation for Computer-aided Jingju Singing Training. Hamburg, Germany.

All versions This version
Views 404404
Downloads 5050
Data volume 656.9 MB656.9 MB
Unique views 385385
Unique downloads 4242


Cite as