SoundDesc: Cleaned and Group-Filtered Splits
Creators
- 1. Huawei Technologies, Munich Research Center, Germany
- 2. Universitat Pompeu Fabra, Music Technology Group, Barcelona, Spain
Description
This upload contains dataset splits of SoundDesc [1] and other supporting material for our paper:
Data leakage in cross-modal retrieval training: A case study [arXiv] [ieeexplore]
In our paper, we demonstrated that a data leakage problem in the previously published splits of SoundDesc leads to overly optimistic retrieval results.
Using an off-the-shelf audio fingerprinting software, we identified that the data leakage stems from duplicates in the dataset.
We define two new splits for the dataset: a cleaned split to remove the leakage and a group-filtered to avoid other kinds of weak contamination of the test data.
SoundDesc is a dataset which was automatically sourced from the BBC Sound Effects web page [2]. The results from our paper can be reproduced using clean_split01 and group_filtered_split01.
If you use the splits, please cite our work:
Benno Weck, Xavier Serra, "Data Leakage in Cross-Modal Retrieval Training: A Case Study," ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 2023, pp. 1-5, doi: 10.1109/ICASSP49357.2023.10094617.
@INPROCEEDINGS{10094617,
author={Weck, Benno and Serra, Xavier},
booktitle={ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
title={Data Leakage in Cross-Modal Retrieval Training: A Case Study},
year={2023},
volume={},
number={},
pages={1-5},
doi={10.1109/ICASSP49357.2023.10094617}}
References:
[1] A. S. Koepke, A. -M. Oncescu, J. Henriques, Z. Akata and S. Albanie, "Audio Retrieval with Natural Language Queries: A Benchmark Study," in IEEE Transactions on Multimedia, doi: 10.1109/TMM.2022.3149712.
[2] https://sound-effects.bbcrewind.co.uk/
Files
splits.zip
Files
(1.3 MB)
Name | Size | Download all |
---|---|---|
md5:030f61918ac65c377660b27c8f908dc5
|
1.3 MB | Preview Download |
md5:7ef362532a4d8542429309e1dfaf20d2
|
22.6 kB | Preview Download |
Additional details
Related works
- Is derived from
- 10.1109/TMM.2022.3149712 (DOI)
- Is supplement to
- Conference paper: 10.1109/ICASSP49357.2023.10094617 (DOI)
- Preprint: 10.48550/arXiv.2302.12258 (DOI)