Published March 21, 2023 | Version v1
Conference paper Open

Data Augmentation Does Not Necessarily Beat a Smart Algorithm

Authors/Creators

  • 1. Jožef Stefan Institute, Ljubljana, Slovenia

Description

According to the “widely acknowledged truth”, more training data beats algorithmic improvements in machine learning tasks. We challenge this “widely acknowledged truth” in context of data augmentation of images and recognition tasks related to images. Our observations show that real training data may be much more valuable than augmented (i.e., artificially generated) data and – most importantly – the advantage of a sophisticated algorithm relative to a simple algorithm may not be easily compensated by data augmentation.

Files

JH2023_buza.pdf

Files (286.4 kB)

Name Size Download all
md5:37edf97a02e9af7587b51065626ad9eb
286.4 kB Preview Download

Additional details

Funding

European Commission
enRichMyData - Enabling Data Enrichment Pipelines for AI-driven Business Products and Services 101070284