Published October 11, 2020 | Version v1
Dataset Open

MGD: Music Genre Dataset

Description

MGD: Music Genre Dataset

Over recent years, the world has seen a dramatic change in the way people consume music, moving from physical records to streaming services. Since 2017, such services have become the main source of revenue within the global recorded music market.
Therefore, this dataset is built by using data from Spotify. It provides a weekly chart of the 200 most streamed songs for each country and territory it is present, as well as an aggregated global chart. 

Considering that countries behave differently when it comes to musical tastes, we use chart data from global and regional markets from January 2017 to December 2019, considering eight of the top 10 music markets according to IFPI: United States (1st), Japan (2nd), United Kingdom (3rd), Germany (4th), France (5th), Canada (8th), Australia (9th), and Brazil (10th).

We also provide information about the hit songs and artists present in the charts, such as all collaborating artists within a song (since the charts only provide the main ones) and their respective genres, which is the core of this work. MGD also provides data about musical collaboration, as we build collaboration networks based on artist partnerships in hit songs. Therefore, this dataset contains:

  • Genre Networks: Success-based genre collaboration networks
  • Genre Mapping: Genre mapping from Spotify genres to super-genres
  • Artist Networks: Success-based artist collaboration networks
  • Artists: Some artist data
  • Hit Songs: Hit Song data and features
  • Charts: Enhanced data from Spotify Weekly Top 200 Charts

This dataset was originally built for a conference paper at ISMIR 2020. If you make use of the dataset, please also cite the following paper:

Gabriel P. Oliveira, Mariana O. Silva, Danilo B. Seufitelli, Anisio Lacerda, and Mirella M. Moro. Detecting Collaboration Profiles in Success-based Music Genre Networks. In Proceedings of the 21st International Society for Music Information Retrieval Conference (ISMIR 2020), 2020.

@inproceedings{ismir/OliveiraSSLM20,
  title = {Detecting Collaboration Profiles in Success-based Music Genre Networks},
  author = {Gabriel P. Oliveira and 
            Mariana O. Silva and 
            Danilo B. Seufitelli and 
            Anisio Lacerda and
            Mirella M. Moro},
  booktitle = {21st International Society for Music Information Retrieval Conference}
  pages = {726--732},
  year = {2020}
}

 

Files

artist_data.zip

Files (16.5 MB)

Name Size Download all
md5:bcd46bed2c0d0ed833c88ededb0e678c
473.0 kB Preview Download
md5:d708b2e60d3e4ff2d5269268c75628fe
422.6 kB Preview Download
md5:6bcab1ccdb027198b6128e89bacf0db9
13.6 MB Preview Download
md5:b4e6dd67eaee2b6a86553bc350d1517c
6.1 kB Preview Download
md5:33c447eb5fece9b4ab937a573a34f661
611.6 kB Preview Download
md5:3a42f89f95acef3939638e8876bbe7cb
1.4 MB Preview Download