Knowledge-based Probabilistic Modeling for Tracking Lyrics in Music Audio Signals

doi:10.5281/zenodo.841980

Published June 28, 2017 | Version v1

Thesis Open

Knowledge-based Probabilistic Modeling for Tracking Lyrics in Music Audio Signals

Dzhambazov, Georgi¹

1. Music Technology Group, Universitat Pompeu Fabra, Spain

Supervisor:

Serra, Xavier¹

1. Universitat Pompeu Fabra, Barcelona, Spain

In this thesis, we devise computational models for tracking sung lyrics in multi-instrumental music recordings. We consider not only the low-level acoustic characteristics, representing the timbre of the sung phonemes, but also higher-level music knowledge, that is complementary to lyrics. We build probabilistic models, based on dynamic Bayesian networks (DBN) that represent the relation of phoneme transitions to two music knowledge facets: the temporal structure of a lyrics line and the structure of the metrical cycle. In one model we exploit the fact the expected syllable durations depend on their position within a lyrics line. Then in another model, we propose how to estimate vocal onsets by tracking simultaneously the position in the metrical cycle, and how these estimated onsets influence the transitions between consecutive phonemes. Using the proposed models sung lyrics are automatically aligned to written lyrics on datasets from Ottoman Turkish makam and Beijing opera, whereby principles, specific for these music traditions are considered. Both models improve a baseline, unaware of music-specific knowledge. This confirms that music-specific knowledge is an important stepping stone for computationally tracking lyrics, especially in the challenging case of singing with instrumental accompaniment.

Notes

Funded byCompMusic project ( European Research Council Grant grant agreement 267583) and the Catalan Scholarship by the Agència de Gestió d'Ajuts Universitaris i de Recerca (AGAUR)

Files

PhDThesis_Georgi_Knowledge_based_Lyrics_Tracking.pdf

Files (6.0 MB)

Name	Size	Download all
PhDThesis_Georgi_Knowledge_based_Lyrics_Tracking.pdf md5:ebbf00ee068e41fe66d5ae5ebd59474e	6.0 MB	Preview Download

Additional details

Is new version of: 10803/404681 (Handle)

	All versions	This version
Views	402	400
Downloads	405	404
Data volume	2.8 GB	2.8 GB

Knowledge-based Probabilistic Modeling for Tracking Lyrics in Music Audio Signals

Creators

Contributors

Supervisor:

Description

Notes

Files

PhDThesis_Georgi_Knowledge_based_Lyrics_Tracking.pdf

Files (6.0 MB)

Additional details

Related works