There is a newer version of this record available.

Dataset Open Access

Jingju a cappella singing dataset part1

Rong Gong; Rafael Caro Repetto; Yile Yang; Xavier Serra


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/d986ed63-86c7-4534-826c-b394706994a2/annotation_txt.zip"
      }, 
      "checksum": "md5:dfb3bfc0322ff3144f713bcaef39d534", 
      "bucket": "d986ed63-86c7-4534-826c-b394706994a2", 
      "key": "annotation_txt.zip", 
      "type": "zip", 
      "size": 217942
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/d986ed63-86c7-4534-826c-b394706994a2/catalogue%20-%20dan.csv"
      }, 
      "checksum": "md5:ffaa9c074e556e1be45f3e6231cdcdd9", 
      "bucket": "d986ed63-86c7-4534-826c-b394706994a2", 
      "key": "catalogue - dan.csv", 
      "type": "csv", 
      "size": 4817
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/d986ed63-86c7-4534-826c-b394706994a2/catalogue%20-%20laosheng.csv"
      }, 
      "checksum": "md5:768fa00ce1f8880ae5480fae103ecc06", 
      "bucket": "d986ed63-86c7-4534-826c-b394706994a2", 
      "key": "catalogue - laosheng.csv", 
      "type": "csv", 
      "size": 3397
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/d986ed63-86c7-4534-826c-b394706994a2/pycode.zip"
      }, 
      "checksum": "md5:1e4c9b2a9a584d13736196fff6e41951", 
      "bucket": "d986ed63-86c7-4534-826c-b394706994a2", 
      "key": "pycode.zip", 
      "type": "zip", 
      "size": 17485
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/d986ed63-86c7-4534-826c-b394706994a2/readme.txt"
      }, 
      "checksum": "md5:f1113d4c03b379a6a23d85e2c215d54b", 
      "bucket": "d986ed63-86c7-4534-826c-b394706994a2", 
      "key": "readme.txt", 
      "type": "txt", 
      "size": 2017
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/d986ed63-86c7-4534-826c-b394706994a2/textgrid.zip"
      }, 
      "checksum": "md5:8088161679f519d13f96dc1be9f53bdd", 
      "bucket": "d986ed63-86c7-4534-826c-b394706994a2", 
      "key": "textgrid.zip", 
      "type": "zip", 
      "size": 1241608
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/d986ed63-86c7-4534-826c-b394706994a2/wav_mono.zip"
      }, 
      "checksum": "md5:4506a948480ff4d46d487148e7528f82", 
      "bucket": "d986ed63-86c7-4534-826c-b394706994a2", 
      "key": "wav_mono.zip", 
      "type": "zip", 
      "size": 686516146
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/d986ed63-86c7-4534-826c-b394706994a2/wav.zip"
      }, 
      "checksum": "md5:4722abda831c20b169a62b2754b15bea", 
      "bucket": "d986ed63-86c7-4534-826c-b394706994a2", 
      "key": "wav.zip", 
      "type": "zip", 
      "size": 868953391
    }
  ], 
  "owners": [
    27301
  ], 
  "doi": "10.5281/zenodo.1244720", 
  "stats": {
    "version_unique_downloads": 304.0, 
    "unique_views": 24.0, 
    "views": 24.0, 
    "version_views": 1417.0, 
    "unique_downloads": 4.0, 
    "version_unique_views": 1160.0, 
    "volume": 3115627028.0, 
    "version_downloads": 1186.0, 
    "downloads": 25.0, 
    "version_volume": 619161382257.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.1244720", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.780559", 
    "bucket": "https://zenodo.org/api/files/d986ed63-86c7-4534-826c-b394706994a2", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.780559.svg", 
    "html": "https://zenodo.org/record/1244720", 
    "latest_html": "https://zenodo.org/record/1323561", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.1244720.svg", 
    "latest": "https://zenodo.org/api/records/1323561"
  }, 
  "conceptdoi": "10.5281/zenodo.780559", 
  "created": "2018-05-10T13:31:54.768680+00:00", 
  "updated": "2020-01-24T19:26:04.656683+00:00", 
  "conceptrecid": "780559", 
  "revision": 9, 
  "id": 1244720, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.1244720", 
    "version": "5", 
    "license": {
      "id": "CC-BY-NC-4.0"
    }, 
    "title": "Jingju a cappella singing dataset part1", 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.780559", 
        "relation": "isVersionOf"
      }
    ], 
    "relations": {
      "version": [
        {
          "count": 7, 
          "index": 4, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "780559"
          }, 
          "is_last": false, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "1323561"
          }
        }
      ]
    }, 
    "communities": [
      {
        "id": "mdm-dtic-upf"
      }, 
      {
        "id": "mir"
      }, 
      {
        "id": "mtgupf"
      }
    ], 
    "grants": [
      {
        "code": "267583", 
        "links": {
          "self": "https://zenodo.org/api/grants/10.13039/501100000780::267583"
        }, 
        "title": "Computational models for the discovery of the world's music", 
        "acronym": "COMPMUSIC", 
        "program": "FP7", 
        "funder": {
          "doi": "10.13039/501100000780", 
          "acronyms": [], 
          "name": "European Commission", 
          "links": {
            "self": "https://zenodo.org/api/funders/10.13039/501100000780"
          }
        }
      }
    ], 
    "keywords": [
      "Beijing opera", 
      "annotation", 
      "phoneme", 
      "syllable", 
      "phrase", 
      "singing voice", 
      "praat", 
      "textgrid", 
      "wave audio", 
      "jingju", 
      "MTG", 
      "C4DM", 
      "a cappella"
    ], 
    "publication_date": "2018-05-10", 
    "creators": [
      {
        "orcid": "0000-0002-4659-9034", 
        "affiliation": "Music Technology Group - Universitat Pompeu Fabra", 
        "name": "Rong Gong"
      }, 
      {
        "affiliation": "Music Technology Group - Universitat Pompeu Fabra", 
        "name": "Rafael Caro Repetto"
      }, 
      {
        "affiliation": "Music Technology Group - Universitat Pompeu Fabra", 
        "name": "Yile Yang"
      }, 
      {
        "orcid": "0000-0003-1395-2345", 
        "affiliation": "Music Technology Group - Universitat Pompeu Fabra", 
        "name": "Xavier Serra"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "dataset", 
      "title": "Dataset"
    }, 
    "description": "<p>This is the 4th version of the dataset. The folder structure has been changed since the 2nd version, where the Laosheng folder has been moved directly into .wav or textgrid&nbsp;folder.</p>\n\n<p><strong>Description:</strong></p>\n\n<p>This dataset is a collection of boundary annotations of a cappella singing performed by Beijing Opera (Jingju, \u4eac\u5267) professional and amateur singers.&nbsp;</p>\n\n<ol>\n\t<li>wav.zip: audio files in .wav format, mono or stereo.</li>\n\t<li>wav_mono.zip: audio files in .wav&nbsp;format, mono</li>\n\t<li>annotation_txt.zip: line, syllable and phoneme time boundaries (second) and labels in .txt format</li>\n\t<li>textgrid.zip: line, syllable and phoneme annotation in Praat .textgrid format</li>\n\t<li>pycode.zip: util code for parsing the .textgrid annotation</li>\n\t<li>catalogue*.csv: recording metadata, source separation recordings are not included.</li>\n</ol>\n\n<p>The boundaries (onset and offset) have been annotated in both <strong>Praat TextGrid (textgrid.zip)</strong> and .<strong>txt (annotation_txt.zip)</strong> format hierarchically:</p>\n\n<ol>\n\t<li>Line (phrase),</li>\n\t<li>syllable,</li>\n\t<li>phoneme</li>\n</ol>\n\n<p>Singing units in pinyin and X-SAMPA have been annotated to a jingju&nbsp;a cappella singing audio dataset.</p>\n\n<p>The corresponding audio files are the a cappella singing arias recordings, which are stereo or mono, sampled at 44.1 kHz, and stored as .wav files. The .wav files are recorded by two institutes: those file names ending with &lsquo;qm&rsquo; are recorded by C4DM, Queen Mary University of London; others file names ending with &lsquo;upf&rsquo; or &lsquo;lon&rsquo; are recorded by MTG-UPF. Additionally, another collection of 15 clean singing recordings is included in this dataset. They are extracted from the commercial recordings which originally contains karaoke accompaniment and mixed versions.</p>\n\n<p><strong>If you use this audio dataset in your work, please cite (1) this dataset as well (2) the following publication:</strong></p>\n\n<blockquote>\n<p>D. A. A. Black, M. Li, and M. Tian, &ldquo;Automatic Identification of Emotional Cues in Chinese Opera Singing,&rdquo; in 13th Int. Conf. on Music Perception and Cognition (ICMPC-2014), 2014, pp. 250&ndash;255.</p>\n</blockquote>\n\n<p>&nbsp;</p>\n\n<p><strong>Details:</strong><br>\nAnnotation format, units, parsing code and other information please refer to <a href=\"https://github.com/MTG/jingjuPhonemeAnnotation\">https://github.com/MTG/jingjuPhonemeAnnotation</a></p>\n\n<p><br>\n<strong>License:</strong><br>\nTextgrid annotations are licensed under Creative Commons Attribution-NonCommercial&nbsp;4.0 International License.</p>\n\n<p>Wav audio ending with &lsquo;upf&rsquo; or &lsquo;lon&rsquo; is licensed under&nbsp;Creative Commons Attribution-NonCommercial&nbsp;4.0 International.</p>\n\n<p>For the license of .wav audio ending with &lsquo;qm&rsquo; from C4DM Queen Mary University of London, please refer to this page <a href=\"http://isophonics.org/SingingVoiceDataset\">http://isophonics.org/SingingVoiceDataset</a></p>\n\n<p><strong>Contact information:</strong></p>\n\n<p>Rong Gong: rong&lt;dot&gt;gong&lt;at&gt;upf&lt;dot&gt;edu</p>\n\n<p>Rafael Caro Repetto: rafael&lt;dot&gt;caro&lt;at&gt;upf&lt;dot&gt;edu</p>"
  }
}
1,417
1,186
views
downloads
All versions This version
Views 1,41724
Downloads 1,18625
Data volume 619.2 GB3.1 GB
Unique views 1,16024
Unique downloads 3044

Share

Cite as