Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published July 27, 2018 | Version 1.0
Dataset Open

cante100 Metadata

  • 1. Universidad de Sevilla
  • 2. Music Technology Group, Universitat Pompeu Fabra, Barcelona, Spain

Description

The cante100 dataset contains 100 tracks taken from the corpus. We defined 10 style families of which 10 tracks each are included. Apart from the style family, we manually annotated the sections of the track in which the vocals are present. In addition, we provide a number of low-level descriptors and the fundamental frequency corresponding to the predominant melody for each track. The meta-information includes editoral meta-data and the musicBrainz ID.

Content:

  • README (5KB): Text file containing detailed descriptions of manual and automatic annotations.
  • meta-data (59KB): XML file containing meta-information: Source (anthology name, CD no. and track no.), editorial meta-data (artist name, title, style, musicBrainzID) and the manually annotated style family.
  • vocal sections (8.9MB): Text file (.csv) containing frame-wise vocal section annotations.
  • automatic transcriptions (375KB): Text files (.notes) and MIDI files (.mid) containing automatic note-level transcriptions of the singing voice.
  • Bark band energies (216.6MB): Text files (.csv) containing the frame-wise extracted bark band energies.
  • predominant melody (33.5MB): Text files (.csv) containing the frame-wise extracted predominant melody.
  • low-level descriptors (42.9MB): Text files (.csv) containing a set of frame-wise extracted low-level features.
  • MFCCs (97.1MB): Text files (.csv) containing the frame-wise extracted mel-frequency cepstral coefficients (MFCCs).
  • Magnitude spectrum (3.85GB): Text files (.csv) containing the frame-wise extracted magnitudes of the discrete fourier transform (DFT).

Publications

This work has been accepted for publication in the ACM Journal of Computation and Cultural heritage and is currently available in arXiv.

N. Kroher, J. M. Díaz-Báñez, J. Mora and E. Gómez (2015): Corpus COFLA: A research corpus for the Computational study of Flamenco Music. arXiv:1510.04029 [cs.SD cs.IR].

https://doi.org/10.1145/2875428

Conditions of use

The provided datasets are offered free of charge for internal non-commercial use. We do not grant any rights for redistribution or modification. All data collections were gathered by the COFLA team.
© COFLA 2015. All rights reserved.

 

cante100 Audio

Files

cante100_automaticTranscription.zip

Files (4.2 GB)

Name Size Download all
md5:47fea64c744f9fe678ae5642a8f0ee8e
375.1 kB Preview Download
md5:397b5fb8dcbab45fd6074fd5e32ee54a
216.6 MB Preview Download
md5:0983e68c7facc0fe85863ce1a78d2fa7
42.9 MB Preview Download
md5:3a934d39c20b37cc77acc398b46b8f5e
97.1 MB Preview Download
md5:184209b7e7d816fa603f0c7f481c0aae
4.9 kB Preview Download
md5:0b81fe0fd7ab2c1adc1ad789edb12981
3.8 GB Preview Download
md5:50aa325f81b73fceae817c8c40bef4c2
8.9 MB Download
md5:6cce186ce77a06541cdb9f0a671afb46
59.0 kB Preview Download
md5:cce543b5125eda5a984347b55fdcd5e8
33.5 MB Preview Download