Low-Complexity Acoustic Scene Classification in DCASE 2022 Challenge
Creators
- 1. Computing Sciences, Tampere University, Finland
- 2. Fondazione Bruno Kessler, Italy
Description
This paper presents an analysis of the Low-Complexity Acoustic Scene Classification task in DCASE 2022 Challenge. The task was a continuation from the previous years, but the low-complexity requirements were changed to the following: the maximum number of allowed parameters, including the zero-valued ones, was 128 K, with parameters being represented using INT8 numerical for- mat; and the maximum number of multiply-accumulate operations at inference time was 30 million. Despite using the same previous year dataset, the audio samples have been shortened to 1 second instead of 10 second for this year challenge. The provided baseline system is a convolutional neural network which employs post-training quantization of parameters, resulting in 46.5 K parameters, and 29.23 million multiply-and-accumulate operations (MMACs). Its performance on the evaluation data is 44.2% accuracy and 1.532 log-loss. In comparison, the top system in the challenge obtained an accuracy of 59.6% and a log loss of 1.091, having 121 K parameters and 28 MMACs. The task received 48 submissions from 19 different teams, most of which outperformed the baseline system.
Files
Low_complexity_acoustic_scene_classification_in_dcase2022_challenge.pdf
Files
(215.4 kB)
Name | Size | Download all |
---|---|---|
md5:db76c39bf7cc8f4b81db1c59124a8930
|
215.4 kB | Preview Download |
Additional details
Related works
- Is published in
- Conference paper: https://dcase.community/documents/workshop2022/proceedings/DCASE2022Workshop_Martin-Morato_32.pdf (URL)
- Is supplemented by
- Software: https://github.com/marmoi/dcase2022_task1_baseline (URL)
- Dataset: 10.5281/zenodo.6337421 (DOI)
- Dataset: 10.5281/zenodo.6591203 (DOI)