Published September 21, 2025
| Version v1
Conference paper
Open
An Evaluation Strategy for Local Key Estimation: Exploiting Cross-Version Consistency
Authors/Creators
Description
Local key estimation (LKE) is an important yet challenging task in music information retrieval since it involves a high level of musical abstraction, which entails ambiguity and low inter-annotator agreement. Relying on limited (small) datasets with a single annotation may introduce not only dataset bias but also annotator bias. To address such problems, we propose in this paper a novel, annotation-free evaluation strategy for LKE. To this end, we exploit datasets where multiple versions of the same musical work are available. We investigate the models' consistency across versions, expecting an effective and robust model to output similar predictions on different versions of the same work. In our experiments, we study the behavior of the proposed cross-version consistency measure at the example of different models and datasets, indicating a strong correlation between cross-version consistency and the models' effectiveness on in-domain data as well as their generalization to out-of-domain data. Our further studies show that, while being correlated to common evaluation metrics, cross-version consistency is also capturing different aspects of model behavior, thus serving as an additional figure of merit for evaluating LKE models.
Files
000019.pdf
Files
(1.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:4fbaa36721e6fcddfcaa73e0dc1e4467
|
1.7 MB | Preview Download |