MSD-I: Million Song Dataset with Images for Multimodal Genre Classification

Sergio Oramas

doi:10.5281/zenodo.1240485

Published May 3, 2018 | Version v1

Dataset Open

MSD-I: Million Song Dataset with Images for Multimodal Genre Classification

Sergio Oramas¹

1. Universitat Pompeu Fabra

The Million Song Dataset (https://labrosa.ee.columbia.edu/millionsong/) is a collection of metadata and precomputed audio features for 1 million songs. Along with this dataset, a dataset with annotations of 15 top-level genres with a single label per song was released. In our work, we combine the CD2c version of this genre datase (http://www.tagtraum.com/msd_genre_datasets.html) with a collection of album cover images.

The final dataset contains 30,713 tracks from the MSD and their related album cover images, each annotated with a unique genre label among 15 classes. Based on an initial analysis on the images, we identified that this set of tracks is associated to 16,753 albums, yielding an average of 1.8 songs per album.

We randomly divide the dataset into three parts: 70% for training, 15% for validation, and 15% for test, with no artist and album overlap across these sets. This is crucial to avoid possible overfitting, as the classifier may learn to predict the artist instead of the genre.

Content:

MSD-I dataset (mapping, metadata, annotations and links to images)
Data splits and feature vectors for TISMIR single-label classification experiments

These data can be used together with the Tartarus deep learning python module https://github.com/sergiooramas/tartarus.

Scientific References:

Please cite the following paper if using MSD-I dataset or Tartarus software.

Oramas, S., Barbieri, F., Nieto, O., and Serra, X (2018). Multimodal Deep Learning for Music Genre Classification, Transactions of the International Society for Music Information Retrieval, V(1).

Files

README.txt

Files (294.3 MB)

Name	Size	Download all
MSD-I_dataset.tsv md5:de600ecf50df88924f580cfa513caab7	4.2 MB	Download
msdi-data.tar.gz md5:67a74bbe8621f808020c5085e512ab9c	290.1 MB	Download
README.txt md5:aca88f7b7c21b923228d62213016277d	1.7 kB	Preview Download

	All versions	This version
Views	3,341	3,323
Downloads	1,257	1,239
Data volume	175.9 GB	174.1 GB

MSD-I: Million Song Dataset with Images for Multimodal Genre Classification

Authors/Creators

Description

Files

README.txt

Files (294.3 MB)