VOCANO: A note transcription framework for singing voice in polyphonic music

Jui-Yang Hsu; Li Su

doi:10.5281/zenodo.5624383

Published November 7, 2021 | Version v1

Conference paper Open

VOCANO: A note transcription framework for singing voice in polyphonic music

High variability of singing voice and insufficiency of note event annotation present a huge bottleneck in singing voice transcription (SVT). In this paper, we present VOCANO, an open-source VOCAl NOte transcription framework built upon robust neural networks with multi-task and semi-supervised learning. Based on a state-of-the-art SVT method, we further consider virtual adversarial training (VAT), a semi-supervised learning (SSL) method for SVT on both clean and accompanied singing voice data, the latter being pre-processed using the singing voice separation (SVS) technique. The proposed framework outperforms the state of the arts on public benchmarks over a wide variety of evaluation metrics. The effects of the types of training models and the sizes of the unlabeled datasets on the performance of SVT are also discussed.

Files

000036.pdf

Files (662.6 kB)

Name	Size	Download all
000036.pdf md5:3649cec177dc05cffb96335d5e165e7f	662.6 kB	Preview Download

379

Views

293

Downloads

Show more details

	All versions	This version
Views	379	378
Downloads	293	293
Data volume	209.4 MB	209.4 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 22nd International Society for Music Information Retrieval Conference, 293-300. Online.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2021) , Online, November 7-12, 2021

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: October 30, 2021
Modified: July 17, 2024

VOCANO: A note transcription framework for singing voice in polyphonic music

Creators

Description

Files

000036.pdf

Files (662.6 kB)