Dataset Open Access

Annotated jingju arias dataset

Shuo Zhang; Rafael Caro Repetto; Xavier Serra; Yile Yang

Thesis supervisor(s)

Serra, Xavier

The Annotated Jingju Arias Dataset is a collection of 34 jingju arias manually segmented in various levels using the software Praat v5.3.53. The selected arias contain samples of the two main shengqiang in jingju, name xipi and erhuang, and the five main role types in terms of singing, namely, dan, jing, laodan, laosheng and xiaosheng.

The dataset includes a Praat TextGrid file for each aria with the following tiers (all the annotations are in Chinese):

  1. aria: name of the work (one segment for the whole aria)
  2. MBID: MusicBrainz ID of the audioi recording (one segment for the whole aria)
  3. artist: name of the singing performer (one segment for the whole aria)
  4. school: related performing school (one segment for the whole aria)
  5. role-type: role type of the singing character (one segment for the whole aria)
  6. shengqiang: boundaries and label of the shengqiang performed in the aria (including accompaniment)
  7. banshi: boundaries and label of the banshi performed in the aria (including accompaniment)
  8. lyrics-lines: boundaries and annotation of each line of lyrics
  9. lyrics-syllables: boundaries and annotation of each syllable
  10. luogu: boundaries and label of each of the performed percussion patterns in the aria

The ariasInfo.txt file contains a summary of the contents per aira of the whole dataset.

A subset of this dataset comprising 20 arias has been used for the study of the relationship between linguistic tones and melody in the following papers:

Shuo Zhang, Rafael Caro Repetto, and Xavier Serra (2014) “Study of the Similarity between Linguistic Tones and Melodic Pitch Contours in Beijing Opera Singing.” In Proceedings of the 15th International Society for Music Information Retrieval Conference (ISMIR 2014), Taipei, Taiwan, October 27–31, pp. 343–348.

______ (2015) “Predicting Pairwise Pitch Contour Relations Based on Linguistic Tone Information in Beijing Opera Singing.” In Proceedings of the 16th International Society for Music Information Retrieval Conference (ISMIR 2015), Málaga, Spain, October 26–30, pp. 107–113.

Here is the list of the arias from the dataset used in these papers.

The whole dataset has been used for the automatic analysis of the structure of jingju arias and their automatic segmentation in the following master's thesis:

Yile Yang (2016) Structure Analysis of Beijing Opera Arias. Master’s thesis, Universitat Pompeu Fabra, Barcelona.

Using this dataset

If you use this dataset in a publication, please cite the above publications.

We are interested in knowing if you find our datasets useful! If you use our dataset please email us at mtg-info@upf.edu and tell us about your research.

Contact

The audio recordings used for these annotations are available for research purposes. Please contact Rafael Caro Repetto

rafael.caro@upf.edu

 

http://compmusic.upf.edu/node/349

Files (261.8 kB)
Name Size
annotated_jingju_arias_1.0.zip
md5:d6d33ea1c117aa28369fce54e934d7a0
261.8 kB Download
  • ______ (2015) "Predicting Pairwise Pitch Contour Relations Based on Linguistic Tone Information in Beijing Opera Singing." In Proceedings of the 16th International Society for Music Information Retrieval Conference (ISMIR 2015), Málaga, Spain, October 26–30, pp. 107–113.

  • Shuo Zhang, Rafael Caro Repetto, and Xavier Serra (2014) "Study of the Similarity between Linguistic Tones and Melodic Pitch Contours in Beijing Opera Singing." In Proceedings of the 15th International Society for Music Information Retrieval Conference (ISMIR 2014), Taipei, Taiwan, October 27–31, pp. 343–348.

  • Yile Yang (2016) Structure Analysis of Beijing Opera Arias. Master's thesis, Universitat Pompeu Fabra, Barcelona.

121
9
views
downloads
All versions This version
Views 121121
Downloads 99
Data volume 2.4 MB2.4 MB
Unique views 116116
Unique downloads 99

Share

Cite as