Language-based audio retrieval DCASE 2022 evaluation dataset
Description
This is the evaluation dataset for Task 6 (Subtask B), Language-based Audio Retrieval, in DCASE 2022 Challenge.
This evaluation dataset is meant to be used for the purposes of the Subtask B in the Task 6 at the scientific challenge 2022. This dataset is not meant to be used for developing language-based audio retrieval methods. For developing language-based audio retrieval methods, you should use the development dataset, i.e., the Clotho v2.1 dataset, which can be found also in Zenodo, at: https://zenodo.org/record/4783391.
== License ==
The audio files in the archives:
- retrieval_audio.7z
and the associated meta-data in the CSV file:
- retrieval_audio_metadata.csv
are under the corresponding licenses of Freesound [1] platform, mentioned explicitly in the CSV file for each of the audio files. That is, each audio file in the 7z archives is listed in the CSV file with the meta-data. The meta-data for each file are:
- File name
- Keywords
- URL for the orignal audio file
- Start and end samples for the excerpt that is used in the dataset
- Uploader/user in the Freesound platform (manufacturer)
- Link to the license of the file
The caption queries in the file:
- retrieval_captions.csv
are under the Tampere University license, described in the LICENSE file.
==References==
[1] Frederic Font, Gerard Roma, and Xavier Serra. 2013. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 411-412. DOI: https://doi.org/10.1145/2502081.2502245
Files
retrieval_audio_metadata.csv
Additional details
References
- Frederic Font, Gerard Roma, and Xavier Serra. 2013. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 411-412. DOI: https://doi.org/10.1145/2502081.2502245