There is a newer version of this record available.

Dataset Open Access

Jingju a cappella singing dataset part1

Rong Gong; Rafael Caro Repetto; Yile Yang; Xavier Serra


Citation Style Language JSON Export

{
  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.1244720", 
  "title": "Jingju a cappella singing dataset part1", 
  "issued": {
    "date-parts": [
      [
        2018, 
        5, 
        10
      ]
    ]
  }, 
  "abstract": "<p>This is the 4th version of the dataset. The folder structure has been changed since the 2nd version, where the Laosheng folder has been moved directly into .wav or textgrid&nbsp;folder.</p>\n\n<p><strong>Description:</strong></p>\n\n<p>This dataset is a collection of boundary annotations of a cappella singing performed by Beijing Opera (Jingju, \u4eac\u5267) professional and amateur singers.&nbsp;</p>\n\n<ol>\n\t<li>wav.zip: audio files in .wav format, mono or stereo.</li>\n\t<li>wav_mono.zip: audio files in .wav&nbsp;format, mono</li>\n\t<li>annotation_txt.zip: line, syllable and phoneme time boundaries (second) and labels in .txt format</li>\n\t<li>textgrid.zip: line, syllable and phoneme annotation in Praat .textgrid format</li>\n\t<li>pycode.zip: util code for parsing the .textgrid annotation</li>\n\t<li>catalogue*.csv: recording metadata, source separation recordings are not included.</li>\n</ol>\n\n<p>The boundaries (onset and offset) have been annotated in both <strong>Praat TextGrid (textgrid.zip)</strong> and .<strong>txt (annotation_txt.zip)</strong> format hierarchically:</p>\n\n<ol>\n\t<li>Line (phrase),</li>\n\t<li>syllable,</li>\n\t<li>phoneme</li>\n</ol>\n\n<p>Singing units in pinyin and X-SAMPA have been annotated to a jingju&nbsp;a cappella singing audio dataset.</p>\n\n<p>The corresponding audio files are the a cappella singing arias recordings, which are stereo or mono, sampled at 44.1 kHz, and stored as .wav files. The .wav files are recorded by two institutes: those file names ending with &lsquo;qm&rsquo; are recorded by C4DM, Queen Mary University of London; others file names ending with &lsquo;upf&rsquo; or &lsquo;lon&rsquo; are recorded by MTG-UPF. Additionally, another collection of 15 clean singing recordings is included in this dataset. They are extracted from the commercial recordings which originally contains karaoke accompaniment and mixed versions.</p>\n\n<p><strong>If you use this audio dataset in your work, please cite (1) this dataset as well (2) the following publication:</strong></p>\n\n<blockquote>\n<p>D. A. A. Black, M. Li, and M. Tian, &ldquo;Automatic Identification of Emotional Cues in Chinese Opera Singing,&rdquo; in 13th Int. Conf. on Music Perception and Cognition (ICMPC-2014), 2014, pp. 250&ndash;255.</p>\n</blockquote>\n\n<p>&nbsp;</p>\n\n<p><strong>Details:</strong><br>\nAnnotation format, units, parsing code and other information please refer to <a href=\"https://github.com/MTG/jingjuPhonemeAnnotation\">https://github.com/MTG/jingjuPhonemeAnnotation</a></p>\n\n<p><br>\n<strong>License:</strong><br>\nTextgrid annotations are licensed under Creative Commons Attribution-NonCommercial&nbsp;4.0 International License.</p>\n\n<p>Wav audio ending with &lsquo;upf&rsquo; or &lsquo;lon&rsquo; is licensed under&nbsp;Creative Commons Attribution-NonCommercial&nbsp;4.0 International.</p>\n\n<p>For the license of .wav audio ending with &lsquo;qm&rsquo; from C4DM Queen Mary University of London, please refer to this page <a href=\"http://isophonics.org/SingingVoiceDataset\">http://isophonics.org/SingingVoiceDataset</a></p>\n\n<p><strong>Contact information:</strong></p>\n\n<p>Rong Gong: rong&lt;dot&gt;gong&lt;at&gt;upf&lt;dot&gt;edu</p>\n\n<p>Rafael Caro Repetto: rafael&lt;dot&gt;caro&lt;at&gt;upf&lt;dot&gt;edu</p>", 
  "author": [
    {
      "family": "Rong Gong"
    }, 
    {
      "family": "Rafael Caro Repetto"
    }, 
    {
      "family": "Yile Yang"
    }, 
    {
      "family": "Xavier Serra"
    }
  ], 
  "version": "5", 
  "type": "dataset", 
  "id": "1244720"
}
1,417
1,186
views
downloads
All versions This version
Views 1,41724
Downloads 1,18625
Data volume 619.2 GB3.1 GB
Unique views 1,16024
Unique downloads 3044

Share

Cite as