Main Melody Estimation with Source-Filter NMF and CRNN

Dogac Basaran; Slim Essid; Geoffroy Peeters

doi:10.5281/zenodo.1492349

Published September 23, 2018 | Version v1

Conference paper Open

Main Melody Estimation with Source-Filter NMF and CRNN

Estimating the main melody of a polyphonic audio recording remains a challenging task. We approach the task from a classification perspective and adopt a convolutional recurrent neural network (CRNN) architecture that relies on a particular form of pretraining by source-filter nonnegative matrix factorisation (NMF). The source-filter NMF decomposition is chosen for its ability to capture the pitch and timbre content of the leading voice/instrument, providing a better initial pitch salience than standard timefrequency representations. Starting from such a musically motivated representation, we propose to further enhance the NMF-based salience representations with CNN layers, then to model the temporal structure by an RNN network and to estimate the dominant melody with a final classification layer. The results show that such a system achieves state-of-the-art performance on the MedleyDB dataset without any augmentation methods or large training sets.

Files

273_Paper.pdf

Files (385.0 kB)

Name	Size	Download all
273_Paper.pdf md5:a22be8b6b73b37fb1065c7c69e2a2407	385.0 kB	Preview Download

296

Views

207

Downloads

Show more details

	All versions	This version
Views	296	296
Downloads	207	207
Data volume	83.9 MB	83.9 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 19th International Society for Music Information Retrieval Conference, 82-89. Paris, France.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2018) , Paris, France, September 23-27, 2018

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 20, 2018
Modified: August 2, 2024

Main Melody Estimation with Source-Filter NMF and CRNN

Creators

Description

Files

273_Paper.pdf

Files (385.0 kB)