Improving Peak-picking Using Multiple Time-step Loss Functions

Carl Southall; Ryan Stables; Jason Hockman

doi:10.5281/zenodo.1492409

Published September 23, 2018 | Version v1

Conference paper Open

Improving Peak-picking Using Multiple Time-step Loss Functions

The majority of state-of-the-art methods for music information retrieval (MIR) tasks now utilise deep learning methods reliant on minimisation of loss functions such as cross entropy. For tasks that include framewise binary classification (e.g., onset detection, music transcription) classes are derived from output activation functions by identifying points of local maxima, or peaks. However, the operating principles behind peak picking are different to that of the cross entropy loss function, which minimises the absolute difference between the output and target values for a single frame. To generate activation functions more suited to peak-picking, we propose two versions of a new loss function that incorporates information from multiple time-steps: 1) multi-individual, which uses multiple individual time-step cross entropies; and 2) multi-difference, which directly compares the difference between sequential time-step outputs. We evaluate the newly proposed loss functions alongside standard cross entropy in the popular MIR tasks of onset detection and automatic drum transcription. The results highlight the effectiveness of these loss functions in the improvement of overall system accuracies for both MIR tasks. Additionally, directly comparing the output from sequential time-steps in the multidifference approach achieves the highest performance.

Files

25_Paper.pdf

Files (4.7 MB)

Name	Size	Download all
25_Paper.pdf md5:612bbdec851966ab6769b5dce0ea445f	4.7 MB	Preview Download

104

Views

Downloads

Show more details

	All versions	This version
Views	104	104
Downloads	66	66
Data volume	313.5 MB	313.5 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 19th International Society for Music Information Retrieval Conference, 313-320. Paris, France.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2018) , Paris, France, September 23-27, 2018

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 20, 2018
Modified: August 2, 2024

Improving Peak-picking Using Multiple Time-step Loss Functions

Creators

Description

Files

25_Paper.pdf

Files (4.7 MB)