There is a newer version of this record available.

Dataset Open Access

Jingju a cappella singing dataset part1

Rong Gong; Rafael Caro Repetto; Yile Yang; Xavier Serra

This is the 3rd version of the dataset. The folder structure has been changed. The Laosheng folder has been put directly in wav or textgrid paths.

Description:

This dataset is a collection of boundary annotations of a cappella singing performed by Beijing Opera (Jingju, 京剧) professional and amateur singers. 

The boundaries have been annotated in both Praat TextGrid (textgrid.zip) and txt (annotation_txt.zip) format hierarchically:

  1. Line (phrase),
  2. syllable,
  3. phoneme

Singing units have been annotated to a jingju a cappella singing audio dataset.

The corresponding audio files are the a cappella singing arias recordings, which are stereo or mono, sampled at 44.1 kHz, and stored as .wav files. The .wav files are recorded by two institutes: those file names ending with ‘qm’ are recorded by C4DM, Queen Mary University of London; others file names ending with ‘upf’ or ‘lon’ are recorded by MTG-UPF. Additionally, another collection of 15 clean singing recordings is included in this dataset. They are extracted from the commercial recordings which originally contains karaoke accompaniment and mixed versions. Please contact the authors to obtain these 15 recordings.

If you use this audio dataset in your work, please cite as well the following publication:

D. A. A. Black, M. Li, and M. Tian, “Automatic Identification of Emotional Cues in Chinese Opera Singing,” in 13th Int. Conf. on Music Perception and Cognition (ICMPC-2014), 2014, pp. 250–255.

 

Details:
Annotation format, units, parsing code and other information please refer to https://github.com/MTG/jingjuPhonemeAnnotation


License:
Textgrid annotations are licensed under Creative Commons Attribution-NonCommercial 4.0 International License.

Wav audio ending with ‘upf’ or ‘lon’ is licensed under Creative Commons Attribution-NonCommercial 4.0 International.

For the license of .wav audio ending with ‘qm’ from C4DM Queen Mary University of London, please refer to this page http://isophonics.org/SingingVoiceDataset

Contact information:

Rong Gong: rong<dot>gong<at>upf<dot>edu

Rafael Caro Repetto: rafael<dot>caro<at>upf<dot>edu

Files (870.4 MB)
Name Size
annotation_txt.zip
md5:10cc65930d3799e450e0a0eb67f2202c
180.0 kB Download
catalogue - dan.csv
md5:ffaa9c074e556e1be45f3e6231cdcdd9
4.8 kB Download
catalogue - laosheng.csv
md5:768fa00ce1f8880ae5480fae103ecc06
3.4 kB Download
pycode.zip
md5:1e4c9b2a9a584d13736196fff6e41951
17.5 kB Download
readme.txt
md5:f1113d4c03b379a6a23d85e2c215d54b
2.0 kB Download
textgrid.zip
md5:8088161679f519d13f96dc1be9f53bdd
1.2 MB Download
wav.zip
md5:4722abda831c20b169a62b2754b15bea
869.0 MB Download
735
468
views
downloads
All versions This version
Views 73516
Downloads 46814
Data volume 172.6 GB871.9 MB
Unique views 63314
Unique downloads 1432

Share

Cite as