Dataset Open Access
Cantoría dataset is a multi-track dataset of 11 songs performed by the professional vocal quartet Cantoría, specialized in the performance of vocal polyphony from the Iberian Golden Age repertoire.
Cantoría dataset is released as part of the following Ph.D. dissertation, which should be cited when using the dataset:
Helena Cuesta (2022). Data-driven Pitch Content Description of Choral Singing Recordings. PhD thesis, Universitat Pompeu Fabra, Barcelona.
(Available online: https://zenodo.org/record/6389642).
The dataset includes the multi-track recordings and automatically extracted F0 trajectories. Particularly, we provide F0 trajectories extracted with pYIN  and CREPE .
Cantoría dataset contains the following songs:
The recorded pieces are accompanied by an organ, recorded at the beginning as a reference track with electronic organ. After the accompaniment recording, each singer was recorded separately, singing all songs.
Cantoría dataset comprises one full run of each song, performed by the SATB quartet and the organ. It includes each individual audio track and the SATB mixture of the four singers, with and without the organ.
The dataset is compressed into a zip file, which also includes a README file with specific information about the folder structure and filenames.
 Mauch, Matthias, and Simon Dixon. "pYIN: A fundamental frequency estimator using probabilistic threshold distributions.". In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2014.
 Kim, J. W., Salamon, J., Li, P., & Bello, J. P. "CREPE: A Convolutional Representation for Pitch Estimation. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2018.