A Cross-Version Approach to Audio Representation Learning for Orchestral Music

Michael Krause; Christof Weiß; Meinard Müller

doi:10.5281/zenodo.10265419

Published November 4, 2023 | Version v1

Conference paper Open

A Cross-Version Approach to Audio Representation Learning for Orchestral Music

Deep learning systems have become popular for tackling a variety of music information retrieval tasks. However, these systems often require large amounts of labeled data for supervised training, which can be very costly to obtain. To alleviate this problem, recent papers on learning music audio representations employ alternative training strategies that utilize unannotated data. In this paper, we introduce a novel cross-version approach to audio representation learning that can be used with music datasets containing several versions (performances) of a musical work. Our method exploits the correspondences that exist between two versions of the same musical section. We evaluate our proposed cross-version approach qualitatively and quantitatively on complex orchestral music recordings and show that it can better capture aspects of instrumentation compared to techniques that do not use cross-version information.

Files

000099.pdf

Files (1.6 MB)

Name	Size	Download all
000099.pdf md5:4f4f72c049284972a6c9b360cbf475c3	1.6 MB	Preview Download

385

Views

425

Downloads

Show more details

	All versions	This version
Views	385	385
Downloads	425	425
Data volume	691.8 MB	691.8 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 24th International Society for Music Information Retrieval Conference, 832-839. Milan, Italy.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2023) , Milan, Italy, November 5-9, 2023

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: December 5, 2023
Modified: July 10, 2024

A Cross-Version Approach to Audio Representation Learning for Orchestral Music

Authors/Creators

Description

Files

000099.pdf

Files (1.6 MB)