Published November 30, 2022 | Version v1
Dataset Open

Learning Unsupervised Hierarchies of Audio Concepts

  • 1. Deezer Research
  • 2. MLIA, Sorbonne-Université

Description

Dataset of our paper "Learning Unsupervised Hierarchies of Audio Concepts" published at the ISMIR 2022 conference.

For usage examples, please refer to our code repository at github.com/deezer/concept_hierarchy.

Paper abstract

Music signals are difficult to interpret from their low-level features, perhaps even more than images: e.g. highlighting part of a spectrogram or an image is often insufficient to convey high-level ideas that are genuinely relevant to humans. In computer vision, concept learning was therein proposed to adjust explanations to the right abstraction level (e.g. detect clinical concepts from radiographs). These methods have yet to be used for MIR.
In this paper, we adapt concept learning to the realm of music, with its particularities. For instance, music concepts are typically non-independent and of mixed nature (e.g. genre, instruments, mood), unlike previous work that assumed disentangled concepts. We propose a method to learn numerous music concepts from audio and then automatically hierarchise them to expose their mutual relationships. We conduct experiments on datasets of playlists from a music streaming service, serving as a few annotated examples for diverse concepts. Evaluations show that the mined hierarchies are aligned with both ground-truth hierarchies of concepts -- when available -- and with proxy sources of concept similarity in the general case.

Citation

If you use this material, please consider citing our work with the following:

@inproceedings{afchar2022learning,
  title={Learning Unsupervised Hierarchies of Audio Concepts},
  author={Afchar, Darius and Hennequin, Romain and Guigue, Vincent},
  booktitle={International Society of Music Information Retrieval Conference (ISMIR)},
  year={2022}
}

How can I send you a love letter?

You can contact us at research@deezer.com.

Files

deezer_anonymised_tracks.zip

Files (2.6 GB)

Name Size Download all
md5:4343b7c10ca617f2a75b5dcf55a8738c
2.6 GB Preview Download
md5:0e1a1af2a781d66b644855cda32533ac
5.7 MB Download
md5:f0c20a3dfbf967ef2c51e21c3acdc2b4
1.4 kB Download

Additional details

References

  • arXiv:2207.11231