Published September 15, 2022 | Version 1.0.0
Dataset Open

Annotated-VocalSet: A Singing Voice Dataset

  • 1. Department of Computer Science, Maynooth University, Maynooth, Co. Kildare, Ireland

Description

This dataset provides annotations for the VocalSet dataset, which is available online at

https://doi.org/10.5281/zenodo.1442513

.

The annotations generated for the VocalSet audio files include fundamental frequency contour, note onset, note offset, the transition between notes, note F0, note duration, Midi pitch, and lyrics.

VocalSet consists of more than 10 hours of monophonic recorded audio of professional singers in a variety of vocal techniques (n = 17) and several singers (m = 20) with several WAV files (p = 3560). However, although several categories, including techniques, singers, tempo, and loudness, are considered in the dataset, the sung notes were not annotated. Therefore, this dataset aims to annotate VocalSet to make it a more powerful dataset for researchers.

Details of the dataset are provided in the following academic journal paper.

Faghih, Behnam, and Joseph Timoney. 2022. "Annotated-VocalSet: A Singing Voice Dataset" Applied Sciences 12, no. 18: 9257. https://doi.org/10.3390/app12189257

Please use the above paper to cite this dataset.

Files

Annotated VocalSet.zip

Files (411.5 MB)

Name Size Download all
md5:f4324592f5edebcb9e572c01531ac63b
411.5 MB Preview Download

Additional details

Related works

Continues
Dataset: 10.5281/zenodo.1193957 (DOI)
Is described by
Journal article: 10.3390/app12189257 (DOI)