There is a newer version of the record available.

Published September 21, 2025 | Version v1
Conference paper Open

Beyond Genre: Diagnosing Bias in Music Embeddings Using Concept Activation Vectors

Description

Music representation models are widely used for tasks such as tagging, retrieval, and music understanding. Yet, their potential to encode cultural bias remains underexplored. In this paper, we apply Concept Activation Vectors (CAVs) to investigate whether non-musical singer attributes—such as gender and language—influence genre representations in unintended ways. We analyze four state-of-the-art models (MERT, Whisper, MuQ, MuQ-MuLan) using the STraDa dataset, carefully balancing training sets to control for genre confounds. Our results reveal significant model-specific biases, aligning with disparities reported in MIR and music sociology. Furthermore, we propose a post-hoc debiasing strategy using concept vector manipulation, demonstrating its effectiveness in mitigating these biases. These findings highlight the need for bias-aware model design and show that conceptualized interpretability methods offer practical tools for diagnosing and mitigating representational bias in MIR.

Files

000032.pdf

Files (648.6 kB)

Name Size Download all
md5:b3898219f5aeed9c80457facccc08630
648.6 kB Preview Download