Dataset Open Access
This is a subset of the LFM-1b LastFM dataset (http://www.cp.jku.at/datasets/LFM-1b/), which consists of the listening events (user_events.txt) for three user groups: 1,000 low-mainstream users (low_main_users.txt), 1,000 medium-mainstream users (medium_main_users.txt) and 1,000 high-mainstream users (high_main_users.txt). The mainstreaminess definition used here is the "M_global_R_APC" one from this paper: https://arxiv.org/ftp/arxiv/papers/1912/1912.06933.pdf
The format of the three user files is "user , mainstreaminess_value"
The format of the user-events file is "user \t artist \t album \t track \t timestamp"
Example Python-code for analyzing this dataset as well as more information on the user groups based on mainstreaminess can be found here: https://github.com/domkowald/LFM_processing
Name | Size | |
---|---|---|
high_main_users.txt
md5:a51b31fc832d94a6addd5e7bca5ddad9 |
20.7 kB | Download |
low_main_users.txt
md5:b186bffd4a44fa67bdff0cedb7169d39 |
21.0 kB | Download |
medium_main_users.txt
md5:7cf8f20d1f2e9322ce5cbfcf898c2f60 |
20.6 kB | Download |
user_events.txt
md5:8777a465c6acc943e608b4cd865954ad |
1.1 GB | Download |
All versions | This version | |
---|---|---|
Views | 1,107 | 1,107 |
Downloads | 1,219 | 1,219 |
Data volume | 702.3 GB | 702.3 GB |
Unique views | 977 | 977 |
Unique downloads | 687 | 687 |