Published October 29, 2018 | Version v1
Dataset Open

MediaEval AcousticBrainz Genre

  • 1. Music Technology Group, Universitat Pompeu Fabra
  • 2. Multimedia Computing Group, Delft University of Technology
  • 3. tagtraum industries incorporated

Description

The AcousticBrainz Genre Dataset consists of four datasets of genre annotations and music features extracted from audio suited for evaluation of hierarchical multi-label genre classification systems.

The datasets are used within the MediaEval AcousticBrainz Genre Task. The task is focused on content-based music
genre recognition using genre annotations from multiple sources and large-scale music features data available in the AcousticBrainz database. The goal of our task is to explore how the same music pieces can be annotated differently by different communities following different genre taxonomies, and how this should be addressed by content-based genre recognition systems.

We provide four datasets containing genre and subgenre annotations extracted from four different online metadata sources:

  • AllMusic and Discogs are based on editorial metadata databases maintained by music experts and enthusiasts. These sources contain explicit genre/subgenre annotations of music releases (albums) following a predefined genre namespace and taxonomy. We propagated release-level annotations to recordings (tracks) in AcousticBrainz to build the datasets.

  • Lastfm and Tagtraum are based on collaborative music tagging platforms with large amounts of genre labels provided by their users for music recordings (tracks). We have automatically inferred a genre/subgenre taxonomy and annotations from these labels.

For details on format and contents, please refer to the data webpage.

Note, that the AllMusic ground-truth annotations are distributed separately at https://zenodo.org/record/2554044.

 

Citation

If you use the MediaEval AcousticBrainz Genre dataset or part of it, please cite our ISMIR 2019 overview paper:

Bogdanov, D., Porter A., Schreiber H., Urbano J., & Oramas S. (2019).
The AcousticBrainz Genre Dataset: Multi-Source, Multi-Level, Multi-Label, and Large-Scale. 
20th International Society for Music Information Retrieval Conference (ISMIR 2019).

 

Acknowledgements

This work is partially supported by the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688382 AudioCommons.

 

Files

Files (36.8 GB)

Name Size Download all
md5:062fc31d1d74ee9f36718eb2a934be79
37.4 MB Download
md5:1b9ae2055c3b4b32c5219ee93992de9e
7.8 MB Download
md5:db7157b5112022d609652dd21c632090
3.8 GB Download
md5:79581967a1be5c52e83be21261d1ef6c
3.8 GB Download
md5:0e48fa319fa48e5cf95eea8118d2e882
3.8 GB Download
md5:22ca7f1fea8a86459b7fda4530f00070
3.8 GB Download
md5:c6e4a2ef1b0e8ed535197b868f8c7302
3.8 GB Download
md5:513d5f306dd4f3799c137423ee444051
3.8 GB Download
md5:422d75d70d583decec0b2761865092a7
3.7 GB Download
md5:021ab25a5fd1b020521824e7fce9c775
3.7 GB Download
md5:f21f9c5e398713139cca9790b656faf9
3.3 GB Download
md5:34f47394ac6d8face4399f48e2b98ebe
3.2 GB Download
md5:797a4d6dfcdbe9c405f336225a97fd9e
23.1 MB Download
md5:9fc1ced2b4317993ad91fd59e718f7ef
4.7 MB Download
md5:91c1f0a325e8972dc63a02f8632b52be
19.8 MB Download
md5:3ff8d4dec1d43bdc791b8a502eedba5b
4.0 MB Download

Additional details

Related works

Is documented by
10230/35744 (Handle)

Funding

AudioCommons – Audio Commons: An Ecosystem for Creative Reuse of Audio Content 688382
European Commission