Deezer listening events dataset
Creators
Contributors
Researchers:
Description
What does this dataset contain?
This dataset contains over 700 million time-stamped listening events collected from 3.4M anonymised users on the music streaming service Deezer, occurred between March and August 2022. It includes 50k anonymised songs, among the most popular ones on the service as well as their pre-trained embedding vectors, calculated by our internal model. All files are in parquet format which could be read by using pandas.read_parquet
function.
What could this dataset be used for?
This dataset could be used for collaborative filtering as well as sequential recommendation (including both next-item and next-session recommendations).
Citation
If you use this dataset, please cite following paper:
@inproceedings{tran-recsys2024, title={Transformers Meet ACT-R: Repeat-Aware and Sequential Listening Session Recommendation}, author={Viet-Anh Tran, Guillaume Salha-Galvan, Bruno Sguerra and Romain Hennequin}, booktitle = {Proceedings of the 18th ACM Conference on Recommender Systems}, year = {2024} }
Files
Files
(7.8 GB)
Name | Size | Download all |
---|---|---|
md5:7e2990b694e056d260139f09c9c72243
|
7.8 GB | Download |