Published April 11, 2023 | Version 0.0.1
Dataset Open

Anechoic and IR Convolution-based Auralization Data Compilation Ensemble (AIRCADE)

  • 1. Universidade Federal de Santa Maria
  • 2. University of Campinas

Description

AIRCADE is a data-compilation ensemble, primarily intended to serve as a resource for researchers in the field of dereverberation, particularly for data-driven approaches. It comprises speech and song samples, together with acoustic guitar sounds, with original annotations pertinent to emotion recognition and Music Information Retrieval (MIR). Moreover, it includes a selection of Impulse Response (IR) samples with varying Reverberation Time (RT) values, providing a wide range of conditions for evaluation. This data-compilation can be used together with provided Python scripts (available on GitHub), for generating auralized data ensembles in different sizes: tinysmallmedium and large. Additionally, the provided metadata annotations also allow for further analysis and investigation of the performance of dereverberation algorithms under different conditions. All data is licensed under Creative Commons Attribution 4.0 International License.

About the sizeable versions:

The data-compilation is hosted here at Zenodo, with an approximate total file size of 1.3 GB. For simplicity, all samples in our data-compilation were renamed, e.g., guitar_0000rir_0000song_0000speech_0000, and so on. The ensemble versions are available in different sizes, from a tiny version, with limited data, to a large version, with almost 300,000 samples. This allows users to choose the most suitable version for their specific research needs. The following table illustrates the differences between all versions, detailing the number of song, speech, guitar, IR and auralized samples in each one, together with their respective total file size and duration.

Number of anechoic, IR and resultant auralized data samples, together with their respective total duration and file size for each ensemble version
Version Tiny Small Medium Large
Song samples 100 500 1,012 1,012
Speech samples 100 500 1,012 1,440
Guitar samples 100 500 1,012 2,004
IR samples 5 9 33 65
Auralized samples 1,500 13,500 100,188 289,640
Total duration 3.2 h 30.41 h 221.77 h 658.08 h
Total file size (required) 1.1 GB 10.5 GB 76.6 GB 227.5 GB

For more information, please refer to our data paper on ArXiv.

Citation:

If you find AIRCADE useful in your research, please cite:

@misc{chiodi2023aircade,
    title={AIRCADE: an Anechoic and IR Convolution-based Auralization Data-compilation Ensemble},
    author={Túlio Chiodi and Arthur dos Santos and Pedro Martins and Bruno Masiero},
    year={2023},
    eprint={2304.09318},
    archivePrefix={arXiv},
    primaryClass={eess.AS}
}

Acknowledgement:

This work was partially supported by the São Paulo Research Foundation (FAPESP), grants #2017/08120-6 and #2019/22795-1.

Files

guitar.zip

Files (1.3 GB)

Name Size Download all
md5:8b81661ea4ee432d264fbaa39413ec2f
808.8 MB Preview Download
md5:8457cf41dded6bce86463b1bdfe0a34e
41.0 kB Preview Download
md5:74cfdb5b4548d0957f22819b73090de8
5.9 MB Preview Download
md5:2aaaa07ba22235b6aee5b6eb0033155c
230.1 MB Preview Download
md5:fdc21b3cb1d527bf90311bc4d1476946
214.6 MB Preview Download