Language-based audio retrieval DCASE 2022 evaluation dataset

Lipping, Samuel

doi:10.5281/zenodo.6590983

Published May 29, 2022 | Version 1.0

Dataset Open

Language-based audio retrieval DCASE 2022 evaluation dataset

Lipping, Samuel¹

1. Tampere University

This is the evaluation dataset for Task 6 (Subtask B), Language-based Audio Retrieval, in DCASE 2022 Challenge.

This evaluation dataset is meant to be used for the purposes of the Subtask B in the Task 6 at the scientific challenge 2022. This dataset is not meant to be used for developing language-based audio retrieval methods. For developing language-based audio retrieval methods, you should use the development dataset, i.e., the Clotho v2.1 dataset, which can be found also in Zenodo, at: https://zenodo.org/record/4783391.

== License ==

The audio files in the archives:

retrieval_audio.7z

and the associated meta-data in the CSV file:

retrieval_audio_metadata.csv

are under the corresponding licenses of Freesound [1] platform, mentioned explicitly in the CSV file for each of the audio files. That is, each audio file in the 7z archives is listed in the CSV file with the meta-data. The meta-data for each file are:

File name
Keywords
URL for the orignal audio file
Start and end samples for the excerpt that is used in the dataset
Uploader/user in the Freesound platform (manufacturer)
Link to the license of the file

The caption queries in the file:

retrieval_captions.csv

are under the Tampere University license, described in the LICENSE file.

==References==

[1] Frederic Font, Gerard Roma, and Xavier Serra. 2013. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 411-412. DOI: https://doi.org/10.1145/2502081.2502245

Files

retrieval_audio_metadata.csv

Files (1.1 GB)

Name	Size	Download all
LICENSE md5:e153e024065bdb45a0d5f8fde274b8bd	1.8 kB	Download
retrieval_audio.7z md5:24102395fd757c462421a483fba5c407	1.1 GB	Download
retrieval_audio_metadata.csv md5:1301db07acbf1e4fabc467eb54e0d353	212.4 kB	Preview Download
retrieval_captions.csv md5:f9e810118be00c64ea8cd7557816d4fe	63.2 kB	Preview Download

Additional details

Frederic Font, Gerard Roma, and Xavier Serra. 2013. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 411-412. DOI: https://doi.org/10.1145/2502081.2502245

	All versions	This version
Views	1,700	1,692
Downloads	1,270	1,266
Data volume	524.1 GB	524.1 GB

Language-based audio retrieval DCASE 2022 evaluation dataset

Authors/Creators

Description

Files

retrieval_audio_metadata.csv

Files (1.1 GB)

Additional details

References