Dataset Open Access

User-aware music auto-tagging with contextual tags

Karim M. Ibrahim; Elena V. Epure; Geoffroy Peeters; Gaël Richard

This is a user-aware music dataset labeled with the contextual use of each track according to each user. The dataset is composed of 10 contextual tags extracted based on user's usage through created playlists in the Deezer catalog. The tags are: " car, gym, happy, night, relax, running, sad, summer, work, workout". For each track/user pair, a contextual tag is associated with it indicating that the user listens to the track in the associated context. Additionally, the users are represented as embeddings based on their listening history computed through the matrix factorization of the user/track matrix.

The creation of the dataset and the baseline of our auto-tagging model is described in the paper: Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, and Gaël Richard. "Should we consider the users in contextual music auto-tagging models?" 21st International Society for Music Information Retrieval Conference (ISMIR). 2020. The source code of the paper is available here: https://github.com/KarimMibrahim/user-aware-music-autotagging

The dataset is composed of the SONG_ID which is the ID of the track in the Deezer catalog. Each track/user pair is labeled with each tag as either 1 (indicating a track's presence in the context) or 0 (indicating a track's absence). The 30 seconds track previews used to train the model in the paper can be accessed through the Deezer API: https://developers.deezer.com/api. Each user is represented with an anonymized USER_ID which is associated with the user embedding available in the user_embeddings.csv file. 

Files (370.8 MB)
Name Size
test_set.csv
md5:a5b3230da305906f8eabf1f345e1dfb7
1.8 MB Download
train_set.csv
md5:78ebc69ed919f3c23700bc145196d499
3.6 MB Download
user_embeddings.csv
md5:920abe59c3b2b7b57e79246554057d4f
364.3 MB Download
validation_set.csv
md5:96c6e6a0728ba53cf3788250bcab1ba5
1.1 MB Download
179
153
views
downloads
All versions This version
Views 179179
Downloads 153153
Data volume 7.5 GB7.5 GB
Unique views 139139
Unique downloads 102102

Share

Cite as