There is a newer version of this record available.

Dataset Open Access

Jingju a cappella singing dataset part1

Rong Gong; Rafael Caro Repetto; Yile Yang; Xavier Serra


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="DOI">10.5281/zenodo.1244720</identifier>
  <creators>
    <creator>
      <creatorName>Rong Gong</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-4659-9034</nameIdentifier>
      <affiliation>Music Technology Group - Universitat Pompeu Fabra</affiliation>
    </creator>
    <creator>
      <creatorName>Rafael Caro Repetto</creatorName>
      <affiliation>Music Technology Group - Universitat Pompeu Fabra</affiliation>
    </creator>
    <creator>
      <creatorName>Yile Yang</creatorName>
      <affiliation>Music Technology Group - Universitat Pompeu Fabra</affiliation>
    </creator>
    <creator>
      <creatorName>Xavier Serra</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0003-1395-2345</nameIdentifier>
      <affiliation>Music Technology Group - Universitat Pompeu Fabra</affiliation>
    </creator>
  </creators>
  <titles>
    <title>Jingju a cappella singing dataset part1</title>
  </titles>
  <publisher>Zenodo</publisher>
  <publicationYear>2018</publicationYear>
  <subjects>
    <subject>Beijing opera</subject>
    <subject>annotation</subject>
    <subject>phoneme</subject>
    <subject>syllable</subject>
    <subject>phrase</subject>
    <subject>singing voice</subject>
    <subject>praat</subject>
    <subject>textgrid</subject>
    <subject>wave audio</subject>
    <subject>jingju</subject>
    <subject>MTG</subject>
    <subject>C4DM</subject>
    <subject>a cappella</subject>
  </subjects>
  <dates>
    <date dateType="Issued">2018-05-10</date>
  </dates>
  <resourceType resourceTypeGeneral="Dataset"/>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/1244720</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.780559</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/mdm-dtic-upf</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/mir</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/mtgupf</relatedIdentifier>
  </relatedIdentifiers>
  <version>5</version>
  <rightsList>
    <rights rightsURI="https://creativecommons.org/licenses/by-nc/4.0/legalcode">Creative Commons Attribution Non Commercial 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;This is the 4th version of the dataset. The folder structure has been changed since the 2nd version, where the Laosheng folder has been moved directly into .wav or textgrid&amp;nbsp;folder.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Description:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This dataset is a collection of boundary annotations of a cappella singing performed by Beijing Opera (Jingju, 京剧) professional and amateur singers.&amp;nbsp;&lt;/p&gt;

&lt;ol&gt;
	&lt;li&gt;wav.zip: audio files in .wav format, mono or stereo.&lt;/li&gt;
	&lt;li&gt;wav_mono.zip: audio files in .wav&amp;nbsp;format, mono&lt;/li&gt;
	&lt;li&gt;annotation_txt.zip: line, syllable and phoneme time boundaries (second) and labels in .txt format&lt;/li&gt;
	&lt;li&gt;textgrid.zip: line, syllable and phoneme annotation in Praat .textgrid format&lt;/li&gt;
	&lt;li&gt;pycode.zip: util code for parsing the .textgrid annotation&lt;/li&gt;
	&lt;li&gt;catalogue*.csv: recording metadata, source separation recordings are not included.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The boundaries (onset and offset) have been annotated in both &lt;strong&gt;Praat TextGrid (textgrid.zip)&lt;/strong&gt; and .&lt;strong&gt;txt (annotation_txt.zip)&lt;/strong&gt; format hierarchically:&lt;/p&gt;

&lt;ol&gt;
	&lt;li&gt;Line (phrase),&lt;/li&gt;
	&lt;li&gt;syllable,&lt;/li&gt;
	&lt;li&gt;phoneme&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Singing units in pinyin and X-SAMPA have been annotated to a jingju&amp;nbsp;a cappella singing audio dataset.&lt;/p&gt;

&lt;p&gt;The corresponding audio files are the a cappella singing arias recordings, which are stereo or mono, sampled at 44.1 kHz, and stored as .wav files. The .wav files are recorded by two institutes: those file names ending with &amp;lsquo;qm&amp;rsquo; are recorded by C4DM, Queen Mary University of London; others file names ending with &amp;lsquo;upf&amp;rsquo; or &amp;lsquo;lon&amp;rsquo; are recorded by MTG-UPF. Additionally, another collection of 15 clean singing recordings is included in this dataset. They are extracted from the commercial recordings which originally contains karaoke accompaniment and mixed versions.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;If you use this audio dataset in your work, please cite (1) this dataset as well (2) the following publication:&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;D. A. A. Black, M. Li, and M. Tian, &amp;ldquo;Automatic Identification of Emotional Cues in Chinese Opera Singing,&amp;rdquo; in 13th Int. Conf. on Music Perception and Cognition (ICMPC-2014), 2014, pp. 250&amp;ndash;255.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Details:&lt;/strong&gt;&lt;br&gt;
Annotation format, units, parsing code and other information please refer to &lt;a href="https://github.com/MTG/jingjuPhonemeAnnotation"&gt;https://github.com/MTG/jingjuPhonemeAnnotation&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;br&gt;
&lt;strong&gt;License:&lt;/strong&gt;&lt;br&gt;
Textgrid annotations are licensed under Creative Commons Attribution-NonCommercial&amp;nbsp;4.0 International License.&lt;/p&gt;

&lt;p&gt;Wav audio ending with &amp;lsquo;upf&amp;rsquo; or &amp;lsquo;lon&amp;rsquo; is licensed under&amp;nbsp;Creative Commons Attribution-NonCommercial&amp;nbsp;4.0 International.&lt;/p&gt;

&lt;p&gt;For the license of .wav audio ending with &amp;lsquo;qm&amp;rsquo; from C4DM Queen Mary University of London, please refer to this page &lt;a href="http://isophonics.org/SingingVoiceDataset"&gt;http://isophonics.org/SingingVoiceDataset&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Contact information:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Rong Gong: rong&amp;lt;dot&amp;gt;gong&amp;lt;at&amp;gt;upf&amp;lt;dot&amp;gt;edu&lt;/p&gt;

&lt;p&gt;Rafael Caro Repetto: rafael&amp;lt;dot&amp;gt;caro&amp;lt;at&amp;gt;upf&amp;lt;dot&amp;gt;edu&lt;/p&gt;</description>
  </descriptions>
  <fundingReferences>
    <fundingReference>
      <funderName>European Commission</funderName>
      <funderIdentifier funderIdentifierType="Crossref Funder ID">10.13039/501100000780</funderIdentifier>
      <awardNumber awardURI="info:eu-repo/grantAgreement/EC/FP7/267583/">267583</awardNumber>
      <awardTitle>Computational models for the discovery of the world's music</awardTitle>
    </fundingReference>
  </fundingReferences>
</resource>
1,417
1,186
views
downloads
All versions This version
Views 1,41724
Downloads 1,18625
Data volume 619.2 GB3.1 GB
Unique views 1,16024
Unique downloads 3044

Share

Cite as