Improving the Perturbation-Based Explanation of Deepfake Detectors Through the Use of Adversarially-Generated Samples

Tsigos, Konstantinos; Apostolidis, Evlampios; Mezaris, Vasileios

doi:10.5281/zenodo.14995960

Published February 2025 | Version v1

Conference paper Open

Improving the Perturbation-Based Explanation of Deepfake Detectors Through the Use of Adversarially-Generated Samples

1. Centre for Research and Technology Hellas

In this paper, we introduce the idea of using adversarially-generated samples of the input images that were classified as deepfakes by a detector, to form perturbation masks for inferring the importance of different input features and produce visual explanations. We generate these samples based on Natural Evolution Strategies, aiming to flip the original deepfake detector's decision and classify these samples as real. We apply this idea to four perturbation-based explanation methods (LIME, SHAP, SOBOL and RISE) and evaluate the performance of the resulting modified methods using a SOTA deepfake detection model, a benchmarking dataset (FaceForensics++) and a corresponding explanation evaluation framework. Our quantitative assessments document the mostly positive contribution of the proposed perturbation approach in the performance of explanation methods. Our qualitative analysis shows the capacity of the modified explanation methods to demarcate the manipulated image regions more accurately, and thus to provide more useful explanations.

Files

WACV2025_AI4MFDD_zenodo.pdf

Files (9.9 MB)

Name	Size	Download all
WACV2025_AI4MFDD_zenodo.pdf md5:29ce41a7f9abf64d1e9979d2fe1e8128	9.9 MB	Preview Download

Additional details

European Commission
AI4TRUST - AI-based-technologies for trustworthy solutions against disinformation 101070190

	All versions	This version
Views	64	64
Downloads	87	87
Data volume	1.0 GB	1.0 GB

Improving the Perturbation-Based Explanation of Deepfake Detectors Through the Use of Adversarially-Generated Samples

Creators

Description

Files

WACV2025_AI4MFDD_zenodo.pdf

Files (9.9 MB)

Additional details

Funding