Dataset Open Access

cante100 Metadata

Nadine Kroher; José Miguel Díaz-Báñez; Joaquin Mora; Emilia Gómez

The cante100 dataset contains 100 tracks taken from the corpus. We defined 10 style families of which 10 tracks each are included. Apart from the style family, we manually annotated the sections of the track in which the vocals are present. In addition, we provide a number of low-level descriptors and the fundamental frequency corresponding to the predominant melody for each track. The meta-information includes editoral meta-data and the musicBrainz ID.

Content:

  • README (5KB): Text file containing detailed descriptions of manual and automatic annotations.
  • meta-data (59KB): XML file containing meta-information: Source (anthology name, CD no. and track no.), editorial meta-data (artist name, title, style, musicBrainzID) and the manually annotated style family.
  • vocal sections (8.9MB): Text file (.csv) containing frame-wise vocal section annotations.
  • automatic transcriptions (375KB): Text files (.notes) and MIDI files (.mid) containing automatic note-level transcriptions of the singing voice.
  • Bark band energies (216.6MB): Text files (.csv) containing the frame-wise extracted bark band energies.
  • predominant melody (33.5MB): Text files (.csv) containing the frame-wise extracted predominant melody.
  • low-level descriptors (42.9MB): Text files (.csv) containing a set of frame-wise extracted low-level features.
  • MFCCs (97.1MB): Text files (.csv) containing the frame-wise extracted mel-frequency cepstral coefficients (MFCCs).
  • Magnitude spectrum (3.85GB): Text files (.csv) containing the frame-wise extracted magnitudes of the discrete fourier transform (DFT).

Publications

This work has been accepted for publication in the ACM Journal of Computation and Cultural heritage and is currently available in arXiv.

N. Kroher, J. M. Díaz-Báñez, J. Mora and E. Gómez (2015): Corpus COFLA: A research corpus for the Computational study of Flamenco Music. arXiv:1510.04029 [cs.SD cs.IR].

https://doi.org/10.1145/2875428

Conditions of use

The provided datasets are offered free of charge for internal non-commercial use. We do not grant any rights for redistribution or modification. All data collections were gathered by the COFLA team.
© COFLA 2015. All rights reserved.

 

cante100 Audio

Files (4.2 GB)
Name Size
cante100_automaticTranscription.zip
md5:47fea64c744f9fe678ae5642a8f0ee8e
375.1 kB Download
cante100_bakbands.zip
md5:397b5fb8dcbab45fd6074fd5e32ee54a
216.6 MB Download
cante100_lowlevel.zip
md5:0983e68c7facc0fe85863ce1a78d2fa7
42.9 MB Download
cante100_mfccs.zip
md5:3a934d39c20b37cc77acc398b46b8f5e
97.1 MB Download
cante100_README.txt
md5:184209b7e7d816fa603f0c7f481c0aae
4.9 kB Download
cante100_spectrum.zip
md5:0b81fe0fd7ab2c1adc1ad789edb12981
3.8 GB Download
cante100_vocal_sections
md5:50aa325f81b73fceae817c8c40bef4c2
8.9 MB Download
cante100Meta.xml
md5:6cce186ce77a06541cdb9f0a671afb46
59.0 kB Download
cante100midi_f0.zip
md5:cce543b5125eda5a984347b55fdcd5e8
33.5 MB Download
171
131
views
downloads
All versions This version
Views 171171
Downloads 131131
Data volume 84.4 GB84.4 GB
Unique views 138138
Unique downloads 2828

Share

Cite as