Dataset Open Access
ESMUC Choir Dataset is a multi-track dataset of Western choral music that contains individual audio recordings of 12 singers, all of them undergraduate students in vocal performance at Escola Superior de Música de Catalunya (ESMUC), the professional music school in Barcelona (Spain), at the time of recording.
ECD is released as part of the following Ph.D. dissertation:
Helena Cuesta. Data-driven Pitch Content Description of Choral Singing Recordings. PhD thesis, Universitat Pompeu Fabra. 2022 (to appear).
Singers are unevenly distributed into Soprano, Alto, Tenor, and Bass (SATB) sections and were recorded simultaneously. A close-up microphone captured each voice, and the whole choir sound was captured by two stereo room microphones placed at two different distances from the singers. ECD comprises all audio files from the multi-track recording and manually corrected annotations of F0 contours and notes.
ECD includes the following pieces:
All singers' tracks are mono audio files, and the two room mics are stereo, all using a sampling rate of 44 100 Hz. The total duration of accumulated audio for the entire dataset is roughly 31 minutes.
ECD contains three songs, two of them recorded in shorter parts, as well as some brief voice warm-up exercises. For each of the songs, the dataset presents three modalities:
Songs and warm-up exercises are organized in Takes, which are numbered, i.e., take1 or take3. For the IS setting, filenames refer to the choir section. Similarly, the short passages are indicated by SE and the passage number. Finally, we denote each singer using S/A/T/B and a number, e.g., T3 refers to the third tenor.
All audio tracks from the dataset, except the room microphones, have two associated annotation files: one for the F0 contour, and a second one with the note annotations. Tracks from the warm-up exercises only have F0 contours, since there is no associated score to them.
A README file accompanies the dataset with specific information about the filenames.
All dataset files and the README are compressed in the provided zip file.
Special cases:
Known issues:
Name | Size | |
---|---|---|
EsmucChoirDataset_v1.0.0.zip
md5:ba2b4b5c4326dbe0a6d391167fa30574 |
2.3 GB | Download |
All versions | This version | |
---|---|---|
Views | 329 | 329 |
Downloads | 85 | 85 |
Data volume | 198.7 GB | 198.7 GB |
Unique views | 279 | 279 |
Unique downloads | 69 | 69 |