Dataset Open Access
Rong Gong;
Rafael Caro Repetto;
Yile Yang;
Xavier Serra
<?xml version='1.0' encoding='utf-8'?> <resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd"> <identifier identifierType="DOI">10.5281/zenodo.1244720</identifier> <creators> <creator> <creatorName>Rong Gong</creatorName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-4659-9034</nameIdentifier> <affiliation>Music Technology Group - Universitat Pompeu Fabra</affiliation> </creator> <creator> <creatorName>Rafael Caro Repetto</creatorName> <affiliation>Music Technology Group - Universitat Pompeu Fabra</affiliation> </creator> <creator> <creatorName>Yile Yang</creatorName> <affiliation>Music Technology Group - Universitat Pompeu Fabra</affiliation> </creator> <creator> <creatorName>Xavier Serra</creatorName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0003-1395-2345</nameIdentifier> <affiliation>Music Technology Group - Universitat Pompeu Fabra</affiliation> </creator> </creators> <titles> <title>Jingju a cappella singing dataset part1</title> </titles> <publisher>Zenodo</publisher> <publicationYear>2018</publicationYear> <subjects> <subject>Beijing opera</subject> <subject>annotation</subject> <subject>phoneme</subject> <subject>syllable</subject> <subject>phrase</subject> <subject>singing voice</subject> <subject>praat</subject> <subject>textgrid</subject> <subject>wave audio</subject> <subject>jingju</subject> <subject>MTG</subject> <subject>C4DM</subject> <subject>a cappella</subject> </subjects> <dates> <date dateType="Issued">2018-05-10</date> </dates> <resourceType resourceTypeGeneral="Dataset"/> <alternateIdentifiers> <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/1244720</alternateIdentifier> </alternateIdentifiers> <relatedIdentifiers> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.780559</relatedIdentifier> <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/mdm-dtic-upf</relatedIdentifier> <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/mir</relatedIdentifier> <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/mtgupf</relatedIdentifier> </relatedIdentifiers> <version>5</version> <rightsList> <rights rightsURI="https://creativecommons.org/licenses/by-nc/4.0/legalcode">Creative Commons Attribution Non Commercial 4.0 International</rights> <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights> </rightsList> <descriptions> <description descriptionType="Abstract"><p>This is the 4th version of the dataset. The folder structure has been changed since the 2nd version, where the Laosheng folder has been moved directly into .wav or textgrid&nbsp;folder.</p> <p><strong>Description:</strong></p> <p>This dataset is a collection of boundary annotations of a cappella singing performed by Beijing Opera (Jingju, 京剧) professional and amateur singers.&nbsp;</p> <ol> <li>wav.zip: audio files in .wav format, mono or stereo.</li> <li>wav_mono.zip: audio files in .wav&nbsp;format, mono</li> <li>annotation_txt.zip: line, syllable and phoneme time boundaries (second) and labels in .txt format</li> <li>textgrid.zip: line, syllable and phoneme annotation in Praat .textgrid format</li> <li>pycode.zip: util code for parsing the .textgrid annotation</li> <li>catalogue*.csv: recording metadata, source separation recordings are not included.</li> </ol> <p>The boundaries (onset and offset) have been annotated in both <strong>Praat TextGrid (textgrid.zip)</strong> and .<strong>txt (annotation_txt.zip)</strong> format hierarchically:</p> <ol> <li>Line (phrase),</li> <li>syllable,</li> <li>phoneme</li> </ol> <p>Singing units in pinyin and X-SAMPA have been annotated to a jingju&nbsp;a cappella singing audio dataset.</p> <p>The corresponding audio files are the a cappella singing arias recordings, which are stereo or mono, sampled at 44.1 kHz, and stored as .wav files. The .wav files are recorded by two institutes: those file names ending with &lsquo;qm&rsquo; are recorded by C4DM, Queen Mary University of London; others file names ending with &lsquo;upf&rsquo; or &lsquo;lon&rsquo; are recorded by MTG-UPF. Additionally, another collection of 15 clean singing recordings is included in this dataset. They are extracted from the commercial recordings which originally contains karaoke accompaniment and mixed versions.</p> <p><strong>If you use this audio dataset in your work, please cite (1) this dataset as well (2) the following publication:</strong></p> <blockquote> <p>D. A. A. Black, M. Li, and M. Tian, &ldquo;Automatic Identification of Emotional Cues in Chinese Opera Singing,&rdquo; in 13th Int. Conf. on Music Perception and Cognition (ICMPC-2014), 2014, pp. 250&ndash;255.</p> </blockquote> <p>&nbsp;</p> <p><strong>Details:</strong><br> Annotation format, units, parsing code and other information please refer to <a href="https://github.com/MTG/jingjuPhonemeAnnotation">https://github.com/MTG/jingjuPhonemeAnnotation</a></p> <p><br> <strong>License:</strong><br> Textgrid annotations are licensed under Creative Commons Attribution-NonCommercial&nbsp;4.0 International License.</p> <p>Wav audio ending with &lsquo;upf&rsquo; or &lsquo;lon&rsquo; is licensed under&nbsp;Creative Commons Attribution-NonCommercial&nbsp;4.0 International.</p> <p>For the license of .wav audio ending with &lsquo;qm&rsquo; from C4DM Queen Mary University of London, please refer to this page <a href="http://isophonics.org/SingingVoiceDataset">http://isophonics.org/SingingVoiceDataset</a></p> <p><strong>Contact information:</strong></p> <p>Rong Gong: rong&lt;dot&gt;gong&lt;at&gt;upf&lt;dot&gt;edu</p> <p>Rafael Caro Repetto: rafael&lt;dot&gt;caro&lt;at&gt;upf&lt;dot&gt;edu</p></description> </descriptions> <fundingReferences> <fundingReference> <funderName>European Commission</funderName> <funderIdentifier funderIdentifierType="Crossref Funder ID">10.13039/501100000780</funderIdentifier> <awardNumber awardURI="info:eu-repo/grantAgreement/EC/FP7/267583/">267583</awardNumber> <awardTitle>Computational models for the discovery of the world's music</awardTitle> </fundingReference> </fundingReferences> </resource>
All versions | This version | |
---|---|---|
Views | 1,417 | 24 |
Downloads | 1,186 | 25 |
Data volume | 619.2 GB | 3.1 GB |
Unique views | 1,160 | 24 |
Unique downloads | 304 | 4 |