Rong Gong
Rafael Caro Repetto
Yile Yang
Xavier Serra
2018-07-30
<p>This is the 4th version of the dataset. The folder structure has been changed since the 2nd version, where the Laosheng folder has been moved directly into wav or textgrid folder.</p>
<p><strong>Description:</strong></p>
<p>This dataset is a collection of boundary annotations of a cappella singing performed by Beijing Opera (Jingju, 京剧) professional and amateur singers. </p>
<ol>
<li>wav.zip: audio files in .wav format, mono or stereo.</li>
<li>wav_mono.zip: audio files in .wav format, mono</li>
<li>pycode.zip: util code for parsing the .textgrid annotation</li>
<li>catalogue*.csv: recording metadata, source separation recordings are not included.</li>
<li>textgrid.zip: phrase, syllable and phoneme annotation in Praat .textgrid format</li>
<li>annotation_txt.zip: phrase, syllable and phoneme time boundaries (second) and labels in .txt format
<ol>
<li>*phrase_char: phrase-level time boundaries, labeled in Mandarin characters</li>
<li>*phrase: phrase-level time boundaries, labeled in Mandarin pinyin</li>
<li>*syllable: syllable-level time boundaries, labeled in Mandarin pinyin</li>
<li>*phoneme: phoneme-level time boundaries, labeled in X-SAMPA</li>
</ol>
</li>
</ol>
<p>The boundaries (onset and offset) have been annotated in both <strong>Praat TextGrid (textgrid.zip)</strong> and .<strong>txt (annotation_txt.zip)</strong> format hierarchically:</p>
<ol>
<li>phrase (line),</li>
<li>syllable,</li>
<li>phoneme</li>
</ol>
<p>Singing units in pinyin and X-SAMPA have been annotated to a jingju a cappella singing audio dataset.</p>
<p>The corresponding audio files are the a cappella singing arias recordings, which are stereo or mono, sampled at 44.1 kHz, and stored as .wav files. The .wav files are recorded by two institutes: those file names ending with ‘qm’ are recorded by C4DM, Queen Mary University of London; others file names ending with ‘upf’ or ‘lon’ are recorded by MTG-UPF. Additionally, another collection of 15 clean singing recordings is included in this dataset. They are extracted from the commercial recordings which originally contains karaoke accompaniment and mixed versions.</p>
<p><strong>If you use this audio dataset in your work, please cite (1) this dataset as well (2) the following publication:</strong></p>
<blockquote>
<p>D. A. A. Black, M. Li, and M. Tian, “Automatic Identification of Emotional Cues in Chinese Opera Singing,” in 13th Int. Conf. on Music Perception and Cognition (ICMPC-2014), 2014, pp. 250–255.</p>
</blockquote>
<p> </p>
<p><strong>Details:</strong><br>
Annotation format, units, parsing code and other information please refer to <a href="https://github.com/MTG/jingjuPhonemeAnnotation">https://github.com/MTG/jingjuPhonemeAnnotation</a></p>
<p><br>
<strong>License:</strong><br>
Textgrid annotations are licensed under Creative Commons Attribution-NonCommercial 4.0 International License.</p>
<p>Wav audio ending with ‘upf’ or ‘lon’ is licensed under Creative Commons Attribution-NonCommercial 4.0 International.</p>
<p>For the license of .wav audio ending with ‘qm’ from C4DM Queen Mary University of London, please refer to this page <a href="http://isophonics.org/SingingVoiceDataset">http://isophonics.org/SingingVoiceDataset</a></p>
<p><strong>Contact information:</strong></p>
<p>Rong Gong: rong<dot>gong<at>upf<dot>edu</p>
<p>Rafael Caro Repetto: rafael<dot>caro<at>upf<dot>edu</p>
https://doi.org/10.5281/zenodo.1323561
oai:zenodo.org:1323561
Zenodo
https://zenodo.org/communities/mtgupf
https://zenodo.org/communities/mir
https://zenodo.org/communities/eu
https://zenodo.org/communities/mdm-dtic-upf
https://doi.org/10.5281/zenodo.780559
info:eu-repo/semantics/openAccess
Creative Commons Attribution Non Commercial 4.0 International
https://creativecommons.org/licenses/by-nc/4.0/legalcode
Beijing opera
annotation
phoneme
syllable
phrase
singing voice
praat
textgrid
wave audio
jingju
MTG
C4DM
a cappella
Jingju a cappella singing dataset part1
info:eu-repo/semantics/other