Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published July 22, 2021 | Version 1.0.0
Dataset Open

DCASE2021 UAD-S UMAP Data

  • 1. University of Surrey

Description

Support data for our paper:

USING UMAP TO INSPECT AUDIO DATA FOR UNSUPERVISED ANOMALY DETECTION UNDER DOMAIN-SHIFT CONDITIONS

ArXiv preprint can be found here. Code for the experiment software pipeline described in the paper can be found here. The pipeline requires and generates different forms of data. Here we provide the following:

  1. AudioSet_wav_fragments.zip: This is a custom selection of 39437 wav files (32kHz, mono, 10 seconds) randomly extracted from AudioSet (originally released under CC-BY). In addition to this custom subset, the paper also uses the following ones, which can be downloaded at their respective websites:
    1. DCASE2021 Task 2 Development Dataset
    2. DCASE2021 Task 2 Additional Training Dataset
    3. Fraunhofer's IDMT-ISA-ELECTRIC-ENGINE Dataset
  2. dcase2021_uads_umaps.zip: To compute the UMAPs, first the log-STFT, log-mel and L3 representations must be extracted, and then the UMAPs must be computed. This can take a substantial amount of time and resources. For convenience, we provide here the 72 UMAPs discussed in the paper.
  3. dcase2021_uads_umap_plots.zip: Also for convenience, we provide here the 198 high-resolution scatter plots rendered from the UMAPs.

For a comprehensive visual inspection of the computed representations, it is sufficient to download the plots only. Users interested in exploring the plots interactively will need to download all the audio datasets and compute the log-STFT, log-mel and L3 representations as well as the UMAPs themselves (code provided in the GitHub repository). UMAPs for further representations can also be computed and plotted.

Files

AudioSet_wav_fragments.zip

Files (22.4 GB)

Name Size Download all
md5:a9915126513213920347a79f4400d452
21.3 GB Preview Download
md5:6f8b8106a299459d114ab04c41c16abc
948.7 MB Preview Download
md5:6ca83bf4e7f82a231643af4e7cc4076c
210.4 MB Preview Download

Additional details

Related works

Is supplement to
Software: https://github.com/andres-fr/dcase2021_umaps (URL)

Funding

AI for Sound EP/T019751/1
UK Research and Innovation