Datasets from the KDD 2021 article "A Semi-Personalized System for User Cold Start Recommendation on Music Streaming Apps"
- 1. Deezer Research
Description
We publicly release the anonymized song_embeddings.parquet user_embeddings.parquet user_features_test.parquet user_features_train.parquet user_features_validation.parquet datasets, with each of the TT-SVD or UT-ALS versions of embeddings, from the music streaming platform Deezer, as described in the article "A Semi-Personalized System for User Cold Start Recommendation on Music Streaming Apps" published in the proceedings of the 27TH ACM SIGKDD conference on knowledge discovery and data mining (KDD 2021). The paper is available here.
These datasets are used in the GitHub repository deezer/semi_perso_user_cold_start to reproduce experiments from the article.
Please cite our paper if you use our code or data in your work.
Files
Files
(3.5 GB)
Name | Size | Download all |
---|---|---|
md5:b430c50686c0e2dfb4c0aadbc916f636
|
129.7 MB | Download |
md5:c5f8843ea95bbedd1c36b64da55b8afd
|
427.2 MB | Download |
md5:825213114a7ba070af520cd584619264
|
161.4 MB | Download |
md5:c192166a5e4b4a4fd742e6ec03415785
|
82.5 MB | Download |
md5:b71349d6c756bb929e3a7803688df7d0
|
1.4 GB | Download |
md5:59a1f3e85e8cfd6903491741386807fd
|
733.9 MB | Download |
md5:bb1965628b4054526c2c7c6df83b26bd
|
320.1 MB | Download |
md5:6a84bea5d9f3332cefee0fe3ac0c7f9d
|
163.4 MB | Download |