There is a newer version of this record available.

Conference paper Open Access

#nowplaying-RS: A New Benchmark Dataset for Building Context-Aware Music Recommender Systems

Asmita Poddar; Eva Zangerle; Yi-Hsuan Yang

Music recommender systems can offer users personalized and contextualized recommendation and are therefore important for music information retrieval. An increasing number of datasets have been compiled to facilitate research on different topics, such as content-based, context-based or next-song recommendation. However, these topics are usually addressed separately using different datasets, due to the lack of a unified dataset that contains a large variety of feature types such as item features, user contexts, and timestamps. To address this issue, we propose a large-scale benchmark dataset called #nowplaying-RS, which contains 11.6 million music listening events (LEs) of 139K users and 346K tracks collected from Twitter. The dataset comes with a rich set of item content features and user context features, and the timestamps of the LEs. Moreover, some of the user context features imply the cultural origin of the users, and some others—like hashtags—give clues to the emotional state of a user underlying an LE. In this paper, we provide some statistics to give insight into the dataset, and some directions in which the dataset can be used for making music recommendation. We also provide standardized training and test sets for experimentation, and some baseline results obtained by using factorization machines.

Files (844.8 kB)
Name Size
smc-2018-nowplaying.pdf
md5:799833f20b30e78762207513e1279558
844.8 kB Download
126
80
views
downloads
All versions This version
Views 12636
Downloads 8024
Data volume 31.0 GB20.3 MB
Unique views 8830
Unique downloads 4520

Share

Cite as