Published September 25, 2020 | Version v1
Dataset Open

Last-fm User and Artist Gender Repository

Authors/Creators

  • 1. Universitat Pompeu Fabra

Description

Dataset consisting of Last.fm listening events data with annotated gender labels for both users and artists. The dataset is generted to acompony the paper 'Exploring Artist Gender Bias in Music Recommendation' submitted to the 2nd Workshop on the Impact of Recommender Systems with ACM RecSys 2020.

The dataset is formed from two Last.fm datasets:

  • Schedl's Lfm-1b - LFM-1b-Le75.csv, LFM1b-MB-artists.txt
  • Celma's Lfm-360k - LastFM360k-Le75.txt, LastFM360k-MB-artists.txt

Artist gender data is recovered via a datawrangler configured to retrieve data from a locally configured version of the music ensicolopedia, MusicBrainz. Code repositories are made openly availible at the following link to elicit reproducibility: https://github.com/dshakes90/LFM-1b-MusicBrainz-Gender-Wrangler

 

 

Files

LastFM360k-Le75.txt

Files (1.2 GB)

Name Size Download all
md5:f9dc8b8e19c48a6ed0a68ca643c7ae9d
99.5 MB Preview Download
md5:76295f58494ec3cf3a872369c3a324e7
16.5 MB Preview Download
md5:32d6fef1d97347eee378513355c4402e
954.7 MB Preview Download
md5:0cbda3182982d053e5d1e1bb98b51daf
131.6 MB Preview Download