Published January 10, 2021 | Version v1
Conference paper Open

Defending Neural ODE Image Classifiers from Adversarial Attacks with Tolerance Randomization

  • 1. CNR-ISTI
  • 2. CNIT

Description

Deep learned models are now largely adopted in different fields, and they generally provide superior performances with respect to classical signal-based approaches. Notwithstanding this, their actual reliability when working in an unprotected environment is far enough to be proven. In this work, we consider a novel deep neural network architecture, named Neural Ordinary Differential Equations (N-ODE), that is getting particular attention due to an attractive property--a test-time tunable trade-off between accuracy and efficiency. This paper analyzes the robustness of N-ODE image classifiers when faced against a strong adversarial attack and how its effectiveness changes when varying such a tunable trade-off. We show that adversarial robustness is increased when the networks operate in different tolerance regimes during test time and training time. On this basis, we propose a novel adversarial detection strategy for N-ODE nets based on the randomization of the adaptive ODE solver tolerance. Our evaluation performed on standard image classification benchmarks shows that our detection technique provides high rejection of adversarial examples while maintaining most of the original samples under white-box attacks and zero-knowledge adversaries.

Files

defending_preprint.pdf

Files (1.0 MB)

Name Size Download all
md5:db6b65e109e63cb0015b95db6dd76fa0
1.0 MB Preview Download

Additional details

Funding

European Commission
AI4Media - A European Excellence Centre for Media, Society and Democracy 951911
European Commission
AI4EU - A European AI On Demand Platform and Ecosystem 825619