Dataset Open Access

VocalSet: A Singing Voice Dataset

Wilkins, Julia; Prem Seetharaman; Alison Wahl; Bryan Pardo


JSON-LD (schema.org) Export

{
  "description": "<p><strong>NEW IN VocalSet 1.2:</strong> We now have 3 file organization versions:</p>\n\n<ol>\n\t<li>Files organized by singer</li>\n\t<li>Files organized by technique</li>\n\t<li>Files organized by vowel</li>\n</ol>\n\n<p>We hope that this will ease the process of training and testing models using these different attributes of the dataset.</p>\n\n<p>&nbsp;</p>\n\n<p><strong>Overview:</strong></p>\n\n<p>We present VocalSet, a singing voice dataset consisting of 10.1 hours of monophonic recorded audio of professional singers demonstrating both standard and extended vocal techniques on all 5 vowels. Existing singing voice datasets aim to capture a focused subset of singing voice characteristics, and generally consist of just a few singers. VocalSet contains recordings from 20 different singers (9 male, 11 female) and a range of voice types. &nbsp;VocalSet aims to improve the state of existing singing voice datasets and singing voice research by capturing not only a range of vowels, but also a diverse set of voices on many different vocal techniques, sung in contexts of scales, arpeggios, long tones, and excerpts.</p>\n\n<p>We have included two .txt files &#39;train_singers_technique.txt &#39;and &#39;test_singers_technique.txt&#39;&nbsp;in which you will find a list of the singers we used to train and test our technique classifier on. &#39;DataSetVocalises.pdf&#39; contains the sheet singers sang from in their recording sessions. &#39;readme-anon.txt&#39; contains more information about the dataset, including the mapping from filename to singer voice type as well as more information on the vocalises that will help you map files to sheet music. Enjoy and please cite accordingly!</p>", 
  "license": "http://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "Northwestern University", 
      "@type": "Person", 
      "name": "Wilkins, Julia"
    }, 
    {
      "affiliation": "Northwestern University", 
      "@type": "Person", 
      "name": "Prem Seetharaman"
    }, 
    {
      "affiliation": "Northwestern University", 
      "@type": "Person", 
      "name": "Alison Wahl"
    }, 
    {
      "affiliation": "Northwestern University", 
      "@type": "Person", 
      "name": "Bryan Pardo"
    }
  ], 
  "url": "https://zenodo.org/record/1442513", 
  "datePublished": "2018-03-08", 
  "version": "1.2", 
  "keywords": [
    "singing", 
    "voice", 
    "singing dataset", 
    "music information retrieval", 
    "vocal technique", 
    "vowel classification", 
    "sung voice"
  ], 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/bb929e07-4372-4d57-9273-a9f1458854a1/VocalSet11.zip", 
      "@type": "DataDownload", 
      "fileFormat": "zip"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/bb929e07-4372-4d57-9273-a9f1458854a1/VocalSet1-2.zip", 
      "@type": "DataDownload", 
      "fileFormat": "zip"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.1442513", 
  "@id": "https://doi.org/10.5281/zenodo.1442513", 
  "@type": "Dataset", 
  "name": "VocalSet: A Singing Voice Dataset"
}
2,051
5,783
views
downloads
All versions This version
Views 2,051780
Downloads 5,783547
Data volume 13.4 TB2.6 TB
Unique views 1,734701
Unique downloads 1,403276

Share

Cite as