Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published July 2, 2018 | Version 1.0
Dataset Open

ISMIR04 Genre Identification task dataset

  • 1. Music Technology Group, Universitat Pompeu Fabra, Barcelona, Spain

Description

This is a collection of audio used for the Genre Identification task of the ISMIR 2004 audio description contest organized by the Music Technology Group (Universitat Pompeu Fabra). The audio for the task was collected from Magnatune, which contains a large amount of music licensed under Creative Commons licenses. The task of the contest was to classify a set of songs into genres, using the genre labels that Magnatune provided in their database.

Further information about the original contest and the contents of the dataset can be obtained from the following technical report:

Cano P, Gómez E, Gouyon F, Herrera P, Koppenberger M, Ong B, Serra X, Streich S, Wack N. ISMIR 2004 audio description contest. Barcelona: Universitat Pompeu Fabra, Music technology Group; 2006. 20 p. Report No.: MTG-TR-2006-02

http://hdl.handle.net/10230/34013

The original contest website can be found at http://ismir2004.ismir.net/genre_contest/

The dataset contains the audio tracks from following 8 genres: classical, electronic, jazz- & blues, metal-, punk, rock-, pop, world.

For the genre recognition contest, the data was grouped into 6 classes: classical, electronic, jazz-blues, metal-punk, rock-pop, world, where in some cases two genres were merged into a single class. Note that ground-truth files uses these 6 classes, however in some cases the data is organised by original genre.

Audio

The audio is in MP3 format. It is divided into three folders, representing different subsets of the collection. Each folder has 729 files, split into classes. The number of files in each category reflects the proportion of files in each category in Magnatune when the dataset was created. No track appears in more than one folder.

  • Training: files for generating a classification model, arranged by class.

  • Development: A separate set of files for participants to test their model against.

  • Evaluation: originally a private subset, the files used to evaluate the accuracy of all submitted models

The training and development set each consist of:

  • classical: 320 files

  • electronic: 115 files

  • jazz_blues: 26 files

  • metal_punk: 45 files

  • rock_pop: 101 files

  • world: 122 files

The evaluation set consists of 729 tracks with a similar distribution.

Metadata

Each folder of audio has a corresponding folder containing metadata of the files in that folder. The metadata is included in a file, tracklist.csv which has the following headers:

class, artist, album, track, track number, file path

The evaluation tracklist file has an additional column representing the magnatune track id of the recording.

Due to the way that the data was collected and distributed for the challenge, the metadata for the development subset is anonymised.

Licensing

The audio is licensed under a CC Attribution-NonCommercial-ShareAlike license (https://creativecommons.org/licenses/by-nc-sa/1.0/).

Using this dataset

We would highly appreciate if scientific publications of works partly based on this dataset cite the above publication.

We are interested in knowing if you find our datasets useful! If you use our dataset please email us at mtg-info@upf.edu and tell us about your research.

Notes

Licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 1.0 Generic license

Files

ismir04_genre.zip

Files (8.6 GB)

Name Size Download all
md5:2b890b8cbebcd080dd02416cfce0dfba
8.6 GB Preview Download

Additional details

Related works

Cites
10230/34013 (Handle)