Dataset Restricted Access

Beijing Opera Percussion Pattern Dataset

Ajay Srinivasamurthy; Rafael Caro Repetto; Harshavardhan Sundar; Xavier Serra

The Beijing Opera Percussion Pattern (BOPP) dataset is a collection of audio examples of percussion patterns played by the percussion ensemble in Beijing Opera (Jingju, 京剧). The percussion ensemble in Jingju plays a set of pre-defined and labeled percussion patterns, which serve many functions. The percussion patterns can be defined as sequences of strokes played by different combinations of the percussion instruments, and the resulting variety of timbres are transmitted using oral syllables as mnemonics. More information on the percussion instruments used in Beijing Opera can be found at http://compmusic.upf.edu/examples-percussion-bo.

The dataset presented here was used as the training dataset in the referenced paper. A detailed description of percussion patterns in Jingju can also be found in it.

DATASET

The dataset is a collection of 133 audio percussion patterns spanning five different pattern classes as described below. The scores for the patterns and additional details about the patterns are at: http://compmusic.upf.edu/bo-perc-patterns

Audio Content

The audio files are short segments containing one of the above mentioned patterns. The audio is stereo, sampled at 44.1 kHz, and stored as wav files. The segments were chosen from the introductory parts of arias. The recordings of arias are from commercially available releases spanning various artists. The audio and segments were chosen carefully by a musicologist to be representative of the percussion patterns that occur in Jingju. The audio segments contain diverse instrument timbres of percussion instruments (though the same set of instruments are played, there can be slight variations in the individual instruments across different ensembles), recording quality and period of the recording. Though these recordings were chosen from introductions of arias where only percussion ensemble is playing, there are some examples in the dataset where the melodic accompaniment starts before the percussion pattern ends. 

Annotations

Each of the audio patterns has an associated syllable level transcription of the audio pattern. The transcription is obtained from the score for the pattern and is not time aligned to the audio. The transcription is done using a reduced set of five syllables and is sufficient to computationally model the timbres of all the syllables. The annotations are stored as Hidden Markov Model Toolkit (HTK) label files. There is also a single master label file provided for batch processing using HTK (http://htk.eng.cam.ac.uk/). 

Dataset organization

The dataset has wav files and label files. The files are named as

<pID><InstID>.<extension>

The pID is as in Table 1, instID is a three digit identifier for the specific instance of the pattern, and extension can be .wav for the audio file or .lab for the label file. pID ϵ {10, 11, 12, 13, 14}, InstID ϵ {1, 2, ..., NpID}. e.g. The audio file and the label file for the fifth instance of the pattern duotuo is named 12005.wav and 12005.lab, respectively. The master label file is called masterLabels.lab

Using this dataset

If you use the dataset in your work, please cite the following publication:

Ajay Srinivasamurthy, Rafael Caro Repetto, Harshavardhan Sundar, Xavier Serra, "Transcription and Recognition of Syllable based Percussion Patterns: The Case of Beijing Opera," in Proceedings of the 15th International Society for Music Information Retrieval (ISMIR) Conference, Taipei, Taiwan, Oct 2014.

http://hdl.handle.net/10230/25677

We are interested in knowing if you find our datasets useful! If you use our dataset please email us at mtg-info@upf.edu and tell us about your research.

CONTACT

If you have any questions or comments about the dataset, please feel free to write to us.

Ajay Srinivasamurthy (ajays.murthy@upf.edu)

Rafael Caro Repetto (rafael.caro@upf.edu)

 

http://compmusic.upf.edu/bopp-dataset

Restricted Access

You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.


The annotations are publicly shared and available under CC BY ND 4.0 licenses. The audio is from commercially available releases. It cannot be publicly shared but can be made available on request for non-commercial research purposes.

Please include in the justification field your academic affiliation (if you have one) and a brief description of your research topics and why you would like to use this dataset. If you do not include this information we may not approve your request.


  • Ajay Srinivasamurthy, Rafael Caro Repetto, Harshavardhan Sundar, Xavier Serra, "Transcription and Recognition of Syllable based Percussion Patterns: The Case of Beijing Opera," in Proceedings of the 15th International Society for Music Information Retrieval (ISMIR) Conference, Taipei, Taiwan, Oct 2014.
46
0
views
downloads
All versions This version
Views 4646
Downloads 00
Data volume 0 Bytes0 Bytes
Unique views 4343
Unique downloads 00

Share

Cite as