Published September 25, 2020
| Version v1
Dataset
Open
Last-fm User and Artist Gender Repository
Description
Dataset consisting of Last.fm listening events data with annotated gender labels for both users and artists. The dataset is generted to acompony the paper 'Exploring Artist Gender Bias in Music Recommendation' submitted to the 2nd Workshop on the Impact of Recommender Systems with ACM RecSys 2020.
The dataset is formed from two Last.fm datasets:
- Schedl's Lfm-1b - LFM-1b-Le75.csv, LFM1b-MB-artists.txt
- Celma's Lfm-360k - LastFM360k-Le75.txt, LastFM360k-MB-artists.txt
Artist gender data is recovered via a datawrangler configured to retrieve data from a locally configured version of the music ensicolopedia, MusicBrainz. Code repositories are made openly availible at the following link to elicit reproducibility: https://github.com/dshakes90/LFM-1b-MusicBrainz-Gender-Wrangler