A Study on the Data Distribution Gap in Music Emotion Recognition

Ching, Joann; Widmer, Gerhard

doi:10.5281/zenodo.17496659

Published November 3, 2025 | Version v1

Conference paper Open

A Study on the Data Distribution Gap in Music Emotion Recognition

Music Emotion Recognition (MER) is a task deeply connected to human perception, relying heavily on subjective annotations collected from contributors. Prior studies tend to focus on specific musical styles rather than incorporating a diverse range of genres, such as rock and classical, within a single framework. In this paper, we address the task of recognizing emotion from audio content by investigating five datasets with dimensional emotion annotations - EmoMusic, DEAM, PMEmo, WTC, and WCMED - which span various musical styles. We demonstrate the problem of out-of-distribution generalization in a systematic experiment. By closely looking at multiple data and feature sets, we provide insight into genre-emotion relationships in existing data and examine potential genre dominance and dataset biases in certain feature representations. Based on these experiments, we arrive at a simple yet effective framework that combines embeddings extracted from the Jukebox model with chroma features and demonstrate how, alongside a combination of several diverse training sets, this permits us to train models with substantially improved cross-dataset generalization capabilities.

Files

CMMR2025_O7_4.pdf

Files (3.0 MB)

Name	Size	Download all
CMMR2025_O7_4.pdf md5:cff863ffe2bb4a04f41072dfb151861e	3.0 MB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	9	9
Downloads	10	10
Data volume	47.4 MB	47.4 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Imprint

Proceedings of the 17th International Symposium on Computer Music Multidisciplinary Research, 307-321. London, United Kingdom. ISBN: 979-10-97498-06-1.

Conference

17th International Symposium on Computer Music Multidisciplinary Research (CMMR 2025) , London, United Kingdom, 3-7 November 2025

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 3, 2025
Modified: November 3, 2025

A Study on the Data Distribution Gap in Music Emotion Recognition

Creators

Description

Files

CMMR2025_O7_4.pdf

Files (3.0 MB)