Published August 5, 2023 | Version 1.0
Conference paper Open

AudRandAug: Random Image Augmentations for Audio Classification

  • 1. CRT-AI Centre, School of Computing, Dublin City University, Ireland
  • 2. Norwegian University of Science and Technology (NTNU), Gjovik, Norway
  • 3. INSIGHT Research Centre, School of Computing, Dublin City University, Ireland
  • 4. Lero Research Centre, School of Computer Science, University of Galway, Ireland

Description

Data augmentation has proven to be effective in training neural networks. Recently, a method called RandAug was proposed, randomly selecting data augmentation techniques from a predefined search space. RandAug has demonstrated significant performance improvements for image-related tasks while imposing minimal computational overhead. However, no prior research has explored the application of RandAug specifically for audio data augmentation, which converts audio into an image-like pattern. To address this gap, we introduce AudRandAug, an adaptation of RandAug for audio data. AudRandAug selects data augmentation policies from a dedicated audio search space. To evaluate the effectiveness of AudRandAug, we conducted experiments using various models and datasets. Our findings indicate that AudRandAug outperforms other existing data augmentation methods regarding accuracy performance. 

Files

IMVIP_2023-4.pdf

Files (514.1 kB)

Name Size Download all
md5:b8de7f88908e5cd11b15890217ac7dec
514.1 kB Preview Download

Additional details

Related works

Is obsoleted by
10.5281/zenodo.8217347 (Handle)

Funding

Science Foundation Ireland
SFI Centre for Research Training in Artificial Intelligence 18/CRT/6223
Science Foundation Ireland
Lero_Phase 2 13/RC/2094_P2