User-aware music auto-tagging with contextual tags

doi:10.5281/zenodo.3961560

Published October 12, 2020 | Version v1

Dataset Open

User-aware music auto-tagging with contextual tags

1. Telecom Paris
2. Deezer

This is a user-aware music dataset labeled with the contextual use of each track according to each user. The dataset is composed of 10 contextual tags extracted based on user's usage through created playlists in the Deezer catalog. The tags are: " car, gym, happy, night, relax, running, sad, summer, work, workout". For each track/user pair, a contextual tag is associated with it indicating that the user listens to the track in the associated context. Additionally, the users are represented as embeddings based on their listening history computed through the matrix factorization of the user/track matrix.

The creation of the dataset and the baseline of our auto-tagging model is described in the paper: Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, and Gaël Richard. "Should we consider the users in contextual music auto-tagging models?" 21st International Society for Music Information Retrieval Conference (ISMIR). 2020. The source code of the paper is available here: https://github.com/KarimMibrahim/user-aware-music-autotagging

The dataset is composed of the SONG_ID which is the ID of the track in the Deezer catalog. Each track/user pair is labeled with each tag as either 1 (indicating a track's presence in the context) or 0 (indicating a track's absence). The 30 seconds track previews used to train the model in the paper can be accessed through the Deezer API: https://developers.deezer.com/api. Each user is represented with an anonymized USER_ID which is associated with the user embedding available in the user_embeddings.csv file.

Files

test_set.csv

Files (370.8 MB)

Name	Size	Download all
test_set.csv md5:a5b3230da305906f8eabf1f345e1dfb7	1.8 MB	Preview Download
train_set.csv md5:78ebc69ed919f3c23700bc145196d499	3.6 MB	Preview Download
user_embeddings.csv md5:920abe59c3b2b7b57e79246554057d4f	364.3 MB	Preview Download
validation_set.csv md5:96c6e6a0728ba53cf3788250bcab1ba5	1.1 MB	Preview Download

Additional details

MIP-Frontiers – New Frontiers in Music Information Processing 765068: European Commission

	All versions	This version
Views	630	629
Downloads	202	202
Data volume	12.1 GB	12.1 GB

User-aware music auto-tagging with contextual tags

Creators

Description

Files

test_set.csv

Files (370.8 MB)

Additional details

Funding