Published October 2, 2025 | Version v1
Journal article Open

Benchmarking and Cross-Platform Evaluation of Public Deepfake Detection Models on Viral Real-World Media

  • 1. Glentree Academy Whitefield

Description

Abstract: This study evaluates the performance of publicly accessible deepfake detection tools on 20 viral political and celebrity videos. Deepfakes pose serious risks to public trust and information integrity, yet the reliability of off-the-shelf detection tools for identifying real-world deepfakes remains unclear. We hypothesised that publicly available detectors would show inconsistent accuracy and produce both false positives and false negatives when applied to in-the-wild videos. To test this, we evaluated 20 viral clips (10 confirmed deepfakes, 10 authentic controls) using two public detection platforms: Deepware AI Scanner and UB Media Forensics Lab's DeepFake-O-Meter. We recorded ensemble and per-model likelihoods across more than ten detectors. Results revealed substantial cross-platform disagreement and significant inconsistencies, including frequent false positives and false negatives. One platform's ensemble flagged only a minority of confirmed deepfakes while the research platform produced extreme per-model score variance, so that sensitivity depended strongly on how an intermediate "Suspicious" label was treated. Depending on the binary mapping used, measured sensitivity varied widely while specificity remained high for this sample. Our results demonstrate the shortcomings of existing detection technologies and the pressing need for more reliable, transparent, and strong deepfake forensic techniques. We conclude that current public detectors provide useful signals but are not yet reliable as sole arbiters of authenticity for viral content. We recommend publishing full per-video numeric outputs, versioned model identifiers, and pairing automated screening with human expert review.

Files

Benchmarking and Cross-Platform Evaluation of Public Deepfake Detection Models on Viral Real-World Media.pdf

Additional details

Related works

Has metadata
Dataset: 10.5281/zenodo.17208948 (DOI)