Using Activation Functions for Improving Measure-Level Audio Synchronization

Yigitcan Özer; Matej Ištvánek; Vlora Arifi-Müller; Meinard Müller

doi:10.5281/zenodo.7316772

There is a newer version of the record available.

Published December 4, 2022 | Version v1

Conference paper Open

Using Activation Functions for Improving Measure-Level Audio Synchronization

Audio synchronization aims at aligning multiple recordings of the same piece of music. Traditional synchronization approaches are often based on dynamic time warping using chroma features as an input representation. Previous work has shown how one can integrate onset cues into this pipeline for improving the alignment's temporal accuracy. Furthermore, recent work based on deep neural networks has led to significant improvements for learning onset, beat, and downbeat activation functions. However, for music with soft onsets and abrupt tempo changes, these functions may be unreliable, leading to unstable results. As the main contribution of this paper, we introduce a combined approach that integrates activation functions into the synchronization pipeline. We show that this approach improves the temporal accuracy thanks to the activation cues while inheriting the robustness of the traditional synchronization approach. Conducting experiments based on string quartet recordings, we evaluate our combined approach where we transfer measure annotations from a reference recording to a target recording.

Files

000090.pdf

Files (948.2 kB)

Name	Size	Download all
000090.pdf md5:a564b81438b3465feed6eee0643a5647	948.2 kB	Preview Download

163

Views

216

Downloads

Show more details

	All versions	This version
Views	163	45
Downloads	216	58
Data volume	214.6 MB	55.9 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 23rd International Society for Music Information Retrieval Conference, 749-755. Bengaluru, India.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2022) , Bengaluru, India, December 4-8, 2022

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 13, 2022
Modified: July 15, 2024

Using Activation Functions for Improving Measure-Level Audio Synchronization

Authors/Creators

Description

Files

000090.pdf

Files (948.2 kB)