Published May 29, 2022 | Version 1.0
Dataset Open

Language-based audio retrieval DCASE 2022 evaluation dataset

  • 1. Tampere University

Description

This is the evaluation dataset for Task 6 (Subtask B), Language-based Audio Retrieval, in DCASE 2022 Challenge.

This evaluation dataset is meant to be used for the purposes of the Subtask B in the Task 6 at the scientific challenge 2022. This dataset is not meant to be used for developing language-based audio retrieval methods. For developing language-based audio retrieval methods, you should use the development dataset, i.e., the Clotho v2.1 dataset, which can be found also in Zenodo, at: https://zenodo.org/record/4783391.

 

== License ==

The audio files in the archives:

  • retrieval_audio.7z

 and the associated meta-data in the CSV file:

  • retrieval_audio_metadata.csv

 are under the corresponding licenses of Freesound [1] platform, mentioned explicitly in the CSV file for each of the audio files. That is, each audio file in the 7z archives is listed in the CSV file with the meta-data. The meta-data for each file are:

  • File name
  • Keywords
  • URL for the orignal audio file
  • Start and end samples for the excerpt that is used in the dataset
  • Uploader/user in the Freesound platform (manufacturer)
  • Link to the license of the file

 The caption queries in the file:

  • retrieval_captions.csv

 are under the Tampere University license, described in the LICENSE file.

 

==References==

[1] Frederic Font, Gerard Roma, and Xavier Serra. 2013. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 411-412. DOI: https://doi.org/10.1145/2502081.2502245

Files

retrieval_audio_metadata.csv

Files (1.1 GB)

Name Size Download all
md5:e153e024065bdb45a0d5f8fde274b8bd
1.8 kB Download
md5:24102395fd757c462421a483fba5c407
1.1 GB Download
md5:1301db07acbf1e4fabc467eb54e0d353
212.4 kB Preview Download
md5:f9e810118be00c64ea8cd7557816d4fe
63.2 kB Preview Download

Additional details

References

  • Frederic Font, Gerard Roma, and Xavier Serra. 2013. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 411-412. DOI: https://doi.org/10.1145/2502081.2502245