Data from "Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness"

Sarkar, Soumyendu; Ramesh Babu, Ashwin; Mousavi, Sajad; Gundecha, Vineet; Ghorbanpour, Sahand; Carmichael, Zachariah; Guillen, Antonio; Luna Gutierrez, Ricardo; Naug, Avisek

doi:10.5281/zenodo.8034833

Published June 13, 2023 | Version v1

Dataset Open

Data from "Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness"

1. Hewlett Packard Enterprise

This repository contains the data from the paper, "Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness."

Relevant URLs:

https://hewlettpackard.github.io/trust-ml/

https://github.com/HewlettPackard/trust-ml/

Abstract:

We present a novel framework for generating adversarial benchmarks to evaluate the robustness of image classification models. The RLAB framework allows users to customize the types of distortions to be optimally applied to images, which helps address the specific distortions relevant to their deployment. The benchmark can generate datasets at various distortion levels to assess the robustness of different image classifiers. Our results show that the adversarial samples generated by our framework with any of the image classification models, like ResNet-50, Inception-V3, and VGG-16, are effective and transferable to other models causing them to fail. These failures happen even when these models are adversarially retrained using state-of-the-art techniques, demonstrating the generalizability of our adversarial samples. Our framework also allows the creation of adversarial samples for non-ground truth classes at different levels of intensity, enabling tunable benchmarks for the evaluation of false positives. We achieve competitive performance in terms of net $L_2$ distortion compared to state-of-the-art benchmark techniques on CIFAR-10 and ImageNet; however, we demonstrate our framework achieves such results with simple distortions like Gaussian noise without introducing unnatural artifacts or color bleeds. This is made possible by a model-based reinforcement learning (RL) agent and a technique that reduces a deep tree search of the image for model sensitivity to perturbations, to a one-level analysis and action. The flexibility of choosing distortions and setting classification probability thresholds for multiple classes makes our framework suitable for algorithmic audits.

Files

cifar_and_imagenet_distorted_data.zip

Files (1.3 GB)

Name	Size	Download all
cifar_and_imagenet_distorted_data.zip md5:f5d23c78f2c4b835c8ddd2d4bc97ec22	1.3 GB	Preview Download

	All versions	This version
Views	87	87
Downloads	23	23
Data volume	33.7 GB	33.7 GB

Data from "Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness"

Creators

Description

Files

cifar_and_imagenet_distorted_data.zip

Files (1.3 GB)