Multi-Task Learning of Graph-based Inductive Representations of Music Content

Antonia Saravanou; Federico Tomasi; Rishabh Mehrotra; Mounia Lalmas

doi:10.5281/zenodo.5624379

Published November 7, 2021 | Version v1

Conference paper Open

Multi-Task Learning of Graph-based Inductive Representations of Music Content

Music streaming platforms rely heavily on learning meaningful representations of tracks to surface apt recommendations to users in a number of different use cases. In this work, we consider the task of learning music track representations by leveraging three rich heterogeneous sources of information: (i) organizational information (e.g., playlist co-occurrence), (ii) content information (e.g., audio & acoustics), and (iii) music stylistics (e.g., genre). We advocate for a multi-task formulation of graph representation learning, and propose MUSIG: Multi-task Sampling and Inductive learning on Graphs. MUSIG allows us to derive generalized track representations that combine the benefits offered by (i) the inductive graph based framework, which generates embeddings by sampling and aggregating features from a node's local neighborhood, as well as, (ii) multi-task training of aggregation functions, which ensures the learnt functions perform well on a number of important tasks. We present large scale empirical results for track recommendation for the playlist completion task, and compare different classes of representation learning approaches, including collaborative filtering, word2vec and node embeddings as well as, graph embedding approaches. Our results demonstrate that considering content information (i.e.,audio and acoustic features) is useful and that multi-task supervision helps learn better representations.

Files

000075.pdf

Files (604.5 kB)

Name	Size	Download all
000075.pdf md5:7aafcbd422dbe6cfcfee95d3d151dfd8	604.5 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	2,859	2,830
Downloads	2,304	2,287
Data volume	1.6 GB	1.6 GB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 22nd International Society for Music Information Retrieval Conference, 602-609. Online.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2021) , Online, November 7-12, 2021

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: October 30, 2021
Modified: July 17, 2024

Multi-Task Learning of Graph-based Inductive Representations of Music Content

Creators

Description

Files

000075.pdf

Files (604.5 kB)