Dataset Open Access
Rong Gong;
Rafael Caro Repetto;
Yile Yang;
Xavier Serra
<?xml version='1.0' encoding='UTF-8'?> <record xmlns="http://www.loc.gov/MARC21/slim"> <leader>00000nmm##2200000uu#4500</leader> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">Beijing opera</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">annotation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">phoneme</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">syllable</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">phrase</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">singing voice</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">praat</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">textgrid</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">wave audio</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">jingju</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">MTG</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">C4DM</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">a cappella</subfield> </datafield> <controlfield tag="005">20200124192615.0</controlfield> <controlfield tag="001">1323561</controlfield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="u">Music Technology Group - Universitat Pompeu Fabra</subfield> <subfield code="a">Rafael Caro Repetto</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="u">Music Technology Group - Universitat Pompeu Fabra</subfield> <subfield code="a">Yile Yang</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="u">Music Technology Group - Universitat Pompeu Fabra</subfield> <subfield code="0">(orcid)0000-0003-1395-2345</subfield> <subfield code="a">Xavier Serra</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">287014</subfield> <subfield code="z">md5:851c9c3fe195fd20bec42d32ddd9deb7</subfield> <subfield code="u">https://zenodo.org/record/1323561/files/annotation_txt.zip</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">6148</subfield> <subfield code="z">md5:82ce90bd8508b1ae12c6a1fe489618a4</subfield> <subfield code="u">https://zenodo.org/record/1323561/files/catalogue - dan.csv</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">3397</subfield> <subfield code="z">md5:768fa00ce1f8880ae5480fae103ecc06</subfield> <subfield code="u">https://zenodo.org/record/1323561/files/catalogue - laosheng.csv</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">17485</subfield> <subfield code="z">md5:1e4c9b2a9a584d13736196fff6e41951</subfield> <subfield code="u">https://zenodo.org/record/1323561/files/pycode.zip</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">2017</subfield> <subfield code="z">md5:f1113d4c03b379a6a23d85e2c215d54b</subfield> <subfield code="u">https://zenodo.org/record/1323561/files/readme.txt</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">1241608</subfield> <subfield code="z">md5:8088161679f519d13f96dc1be9f53bdd</subfield> <subfield code="u">https://zenodo.org/record/1323561/files/textgrid.zip</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">686516146</subfield> <subfield code="z">md5:4506a948480ff4d46d487148e7528f82</subfield> <subfield code="u">https://zenodo.org/record/1323561/files/wav_mono.zip</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">868953391</subfield> <subfield code="z">md5:4722abda831c20b169a62b2754b15bea</subfield> <subfield code="u">https://zenodo.org/record/1323561/files/wav.zip</subfield> </datafield> <datafield tag="542" ind1=" " ind2=" "> <subfield code="l">open</subfield> </datafield> <datafield tag="260" ind1=" " ind2=" "> <subfield code="c">2018-07-30</subfield> </datafield> <datafield tag="909" ind1="C" ind2="O"> <subfield code="p">openaire_data</subfield> <subfield code="p">user-mdm-dtic-upf</subfield> <subfield code="p">user-mir</subfield> <subfield code="p">user-mtgupf</subfield> <subfield code="o">oai:zenodo.org:1323561</subfield> </datafield> <datafield tag="100" ind1=" " ind2=" "> <subfield code="u">Music Technology Group - Universitat Pompeu Fabra</subfield> <subfield code="0">(orcid)0000-0002-4659-9034</subfield> <subfield code="a">Rong Gong</subfield> </datafield> <datafield tag="245" ind1=" " ind2=" "> <subfield code="a">Jingju a cappella singing dataset part1</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-mdm-dtic-upf</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-mir</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-mtgupf</subfield> </datafield> <datafield tag="536" ind1=" " ind2=" "> <subfield code="c">267583</subfield> <subfield code="a">Computational models for the discovery of the world's music</subfield> </datafield> <datafield tag="540" ind1=" " ind2=" "> <subfield code="u">https://creativecommons.org/licenses/by-nc/4.0/legalcode</subfield> <subfield code="a">Creative Commons Attribution Non Commercial 4.0 International</subfield> </datafield> <datafield tag="650" ind1="1" ind2="7"> <subfield code="a">cc-by</subfield> <subfield code="2">opendefinition.org</subfield> </datafield> <datafield tag="520" ind1=" " ind2=" "> <subfield code="a"><p>This is the 4th version of the dataset. The folder structure has been changed since the 2nd version, where the Laosheng folder has been moved directly into wav or textgrid&nbsp;folder.</p> <p><strong>Description:</strong></p> <p>This dataset is a collection of boundary annotations of a cappella singing performed by Beijing Opera (Jingju, 京剧) professional and amateur singers.&nbsp;</p> <ol> <li>wav.zip: audio files in .wav format, mono or stereo.</li> <li>wav_mono.zip: audio files in .wav&nbsp;format, mono</li> <li>pycode.zip: util code for parsing the .textgrid annotation</li> <li>catalogue*.csv: recording metadata, source separation recordings are not included.</li> <li>textgrid.zip: phrase, syllable and phoneme annotation in Praat .textgrid format</li> <li>annotation_txt.zip: phrase, syllable and phoneme time boundaries (second) and labels in .txt format <ol> <li>*phrase_char: phrase-level time boundaries, labeled in Mandarin characters</li> <li>*phrase:&nbsp;phrase-level time boundaries, labeled in Mandarin pinyin</li> <li>*syllable: syllable-level time boundaries,&nbsp;labeled in Mandarin pinyin</li> <li>*phoneme: phoneme-level time boundaries, labeled in X-SAMPA</li> </ol> </li> </ol> <p>The boundaries (onset and offset) have been annotated in both <strong>Praat TextGrid (textgrid.zip)</strong> and .<strong>txt (annotation_txt.zip)</strong> format hierarchically:</p> <ol> <li>phrase (line),</li> <li>syllable,</li> <li>phoneme</li> </ol> <p>Singing units in pinyin and X-SAMPA have been annotated to a jingju&nbsp;a cappella singing audio dataset.</p> <p>The corresponding audio files are the a cappella singing arias recordings, which are stereo or mono, sampled at 44.1 kHz, and stored as .wav files. The .wav files are recorded by two institutes: those file names ending with &lsquo;qm&rsquo; are recorded by C4DM, Queen Mary University of London; others file names ending with &lsquo;upf&rsquo; or &lsquo;lon&rsquo; are recorded by MTG-UPF. Additionally, another collection of 15 clean singing recordings is included in this dataset. They are extracted from the commercial recordings which originally contains karaoke accompaniment and mixed versions.</p> <p><strong>If you use this audio dataset in your work, please cite (1) this dataset as well (2) the following publication:</strong></p> <blockquote> <p>D. A. A. Black, M. Li, and M. Tian, &ldquo;Automatic Identification of Emotional Cues in Chinese Opera Singing,&rdquo; in 13th Int. Conf. on Music Perception and Cognition (ICMPC-2014), 2014, pp. 250&ndash;255.</p> </blockquote> <p>&nbsp;</p> <p><strong>Details:</strong><br> Annotation format, units, parsing code and other information please refer to <a href="https://github.com/MTG/jingjuPhonemeAnnotation">https://github.com/MTG/jingjuPhonemeAnnotation</a></p> <p><br> <strong>License:</strong><br> Textgrid annotations are licensed under Creative Commons Attribution-NonCommercial&nbsp;4.0 International License.</p> <p>Wav audio ending with &lsquo;upf&rsquo; or &lsquo;lon&rsquo; is licensed under&nbsp;Creative Commons Attribution-NonCommercial&nbsp;4.0 International.</p> <p>For the license of .wav audio ending with &lsquo;qm&rsquo; from C4DM Queen Mary University of London, please refer to this page <a href="http://isophonics.org/SingingVoiceDataset">http://isophonics.org/SingingVoiceDataset</a></p> <p><strong>Contact information:</strong></p> <p>Rong Gong: rong&lt;dot&gt;gong&lt;at&gt;upf&lt;dot&gt;edu</p> <p>Rafael Caro Repetto: rafael&lt;dot&gt;caro&lt;at&gt;upf&lt;dot&gt;edu</p></subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="n">doi</subfield> <subfield code="i">isVersionOf</subfield> <subfield code="a">10.5281/zenodo.780559</subfield> </datafield> <datafield tag="024" ind1=" " ind2=" "> <subfield code="a">10.5281/zenodo.1323561</subfield> <subfield code="2">doi</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">dataset</subfield> </datafield> </record>
All versions | This version | |
---|---|---|
Views | 1,417 | 531 |
Downloads | 1,186 | 494 |
Data volume | 619.2 GB | 128.5 GB |
Unique views | 1,160 | 473 |
Unique downloads | 304 | 119 |