Published January 14, 2022 | Version 1.0.0
Dataset Open

Cantoría Dataset

  • 1. Music Technology Group, Universitat Pompeu Fabra
  • 2. Music Technology Group, Universitat Pompeu Fabra / European Commission, Joint Research Centre


Cantoría dataset is a multi-track dataset of 11 songs performed by the professional vocal quartet Cantoría, specialized in the performance of vocal polyphony from the Iberian Golden Age repertoire.

Cantoría dataset is released as part of the following Ph.D. dissertation, which should be cited when using the dataset:

Helena Cuesta (2022). Data-driven Pitch Content Description of Choral Singing Recordings. PhD thesis, Universitat Pompeu Fabra, Barcelona.

(Available online:

The dataset includes the multi-track recordings and automatically extracted F0 trajectories. Particularly, we provide F0 trajectories extracted with pYIN [1] and CREPE [2].

Cantoría dataset contains the following songs:

  1. Sus sus sus, written by Bartomeu Cáceres.
  2. Riu riu chiu, an anonymous villancico.
  3. El Jubilate, written by Mateo Flecha “el viejo”.
  4. Virgen Bendita sin par, written by Pedro de Escobar.
  5. Hoy comamos y bebamos, written by Juan del Encina.
  6. La Negrina, written by Mateo Flecha “el viejo”.
  7. Teresica hermana, written by Mateo Flecha “el viejo”.
  8. Corten espadas afiladas, an anonymous secular villancico.
  9. La Justa, written by Mateo Flecha “el viejo”.
  10. La Bomba, written by Mateo Flecha “el viejo”.
  11. Yo me soy la morenica, an anonymous secular villancico.

The recorded pieces are accompanied by an organ, recorded at the beginning as a reference track with electronic organ. After the accompaniment recording, each singer was recorded separately, singing all songs.

Cantoría dataset comprises one full run of each song, performed by the SATB quartet and the organ. It includes each individual audio track and the SATB mixture of the four singers, with and without the organ.

The dataset is compressed into a zip file, which also includes a README file with specific information about the folder structure and filenames.

[1] Mauch, Matthias, and Simon Dixon. "pYIN: A fundamental frequency estimator using probabilistic threshold distributions.". In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2014.

[2] Kim, J. W., Salamon, J., Li, P., & Bello, J. P. "CREPE: A Convolutional Representation for Pitch Estimation. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2018.



The recording and curation of this dataset is supported by the European Commission under the TROMPA project (H2020 770376) and the Spanish Ministry of Science and Innovation under the Musical AI project (PID2019-111403GB-I00).


Files (863.2 MB)

Name Size Download all
863.2 MB Preview Download

Additional details


TROMPA – Towards Richer Online Music Public-domain Archives 770376
European Commission