AudioSet Strong Ensemble Logits

Schmid, Florian

doi:10.5281/zenodo.14626113

Published January 10, 2025 | Version v1

Dataset Open

AudioSet Strong Ensemble Logits

Schmid, Florian (Data curator)¹

1. Johannes Kepler University of Linz

This upload contains one HDF5 file that stores ensemble predictions on AudioSet Strong audio files. It is supplementary material for the ICASSP'25 paper Effective Pre-Training of Audio Transformers for Sound Event Detection. The corresponding code can be found in this GitHub repository.

The HDF5 file contains filenames (Youtube IDs) matched with ensembled logits of multiple transformer models. The corresponding keys are "filenames" and "strong_logits". Ensemble Logits for one file are of shape 447 x 250 (number of classes x timeframes at 40 ms resolution). Ensemble Logits are stored in float16 format to save space. Check out the GitHub repository for information on how to use the ensemble logits.

Files

Files (22.6 GB)

Name	Size	Download all
audioset_strong_ensemble_logits.hdf5 md5:2e34dd1fc30a084bff9234e4cbd89b53	22.6 GB	Download

Additional details

Repository URL: https://github.com/fschmid56/PretrainedSED
Development Status: Active

263

Views

145

Downloads

Show more details

	All versions	This version
Views	263	263
Downloads	145	145
Data volume	3.7 TB	3.7 TB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Conference

2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025) , Hyderabad, India, 6-11 April 2025

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: January 10, 2025
Modified: January 10, 2025

AudioSet Strong Ensemble Logits

Authors/Creators

Description

Files

Files (22.6 GB)

Additional details

Software