Jingju a cappella singing dataset part1

Rong Gong; Rafael Caro Repetto; Yile Yang; Xavier Serra

doi:10.5281/zenodo.1244720

Published May 10, 2018 | Version 5

Dataset Open

Jingju a cappella singing dataset part1

1. Music Technology Group - Universitat Pompeu Fabra

This is the 4th version of the dataset. The folder structure has been changed since the 2nd version, where the Laosheng folder has been moved directly into .wav or textgrid folder.

Description:

This dataset is a collection of boundary annotations of a cappella singing performed by Beijing Opera (Jingju, 京剧) professional and amateur singers.

wav.zip: audio files in .wav format, mono or stereo.
wav_mono.zip: audio files in .wav format, mono
annotation_txt.zip: line, syllable and phoneme time boundaries (second) and labels in .txt format
textgrid.zip: line, syllable and phoneme annotation in Praat .textgrid format
pycode.zip: util code for parsing the .textgrid annotation
catalogue*.csv: recording metadata, source separation recordings are not included.

The boundaries (onset and offset) have been annotated in both Praat TextGrid (textgrid.zip) and .txt (annotation_txt.zip) format hierarchically:

Line (phrase),
syllable,
phoneme

Singing units in pinyin and X-SAMPA have been annotated to a jingju a cappella singing audio dataset.

The corresponding audio files are the a cappella singing arias recordings, which are stereo or mono, sampled at 44.1 kHz, and stored as .wav files. The .wav files are recorded by two institutes: those file names ending with ‘qm’ are recorded by C4DM, Queen Mary University of London; others file names ending with ‘upf’ or ‘lon’ are recorded by MTG-UPF. Additionally, another collection of 15 clean singing recordings is included in this dataset. They are extracted from the commercial recordings which originally contains karaoke accompaniment and mixed versions.

If you use this audio dataset in your work, please cite (1) this dataset as well (2) the following publication:

D. A. A. Black, M. Li, and M. Tian, “Automatic Identification of Emotional Cues in Chinese Opera Singing,” in 13th Int. Conf. on Music Perception and Cognition (ICMPC-2014), 2014, pp. 250–255.

Details:
Annotation format, units, parsing code and other information please refer to https://github.com/MTG/jingjuPhonemeAnnotation

License:
Textgrid annotations are licensed under Creative Commons Attribution-NonCommercial 4.0 International License.

Wav audio ending with ‘upf’ or ‘lon’ is licensed under Creative Commons Attribution-NonCommercial 4.0 International.

For the license of .wav audio ending with ‘qm’ from C4DM Queen Mary University of London, please refer to this page http://isophonics.org/SingingVoiceDataset

Contact information:

Rong Gong: rong<dot>gong<at>upf<dot>edu

Rafael Caro Repetto: rafael<dot>caro<at>upf<dot>edu

Files

annotation_txt.zip

Files (1.6 GB)

Name	Size	Download all
annotation_txt.zip md5:dfb3bfc0322ff3144f713bcaef39d534	217.9 kB	Preview Download
catalogue - dan.csv md5:ffaa9c074e556e1be45f3e6231cdcdd9	4.8 kB	Preview Download
catalogue - laosheng.csv md5:768fa00ce1f8880ae5480fae103ecc06	3.4 kB	Preview Download
pycode.zip md5:1e4c9b2a9a584d13736196fff6e41951	17.5 kB	Preview Download
readme.txt md5:f1113d4c03b379a6a23d85e2c215d54b	2.0 kB	Preview Download
textgrid.zip md5:8088161679f519d13f96dc1be9f53bdd	1.2 MB	Preview Download
wav.zip md5:4722abda831c20b169a62b2754b15bea	869.0 MB	Preview Download
wav_mono.zip md5:4506a948480ff4d46d487148e7528f82	686.5 MB	Preview Download

Additional details

European Commission
COMPMUSIC – Computational models for the discovery of the world's music 267583

Citations

Oops! Something went wrong while fetching results.

	All versions	This version
Views	5,123	334
Downloads	2,053	123
Data volume	1.3 TB	25.3 GB

Jingju a cappella singing dataset part1

Creators

Description

Files

annotation_txt.zip

Files (1.6 GB)

Additional details

Funding