Published April 14, 2026 | Version v1
Publication Open

PhantomRed: An Autonomous AI-Powered Penetration Testing Platform with a Consent-First Ethical Framework

  • 1. Independent Researcher

Description

Penetration testing remains a cornerstone of modern cybersecurity practice, yet its adoption is hindered by high cost, scarce expertise, and time-intensive manual workflows. We present PhantomRed, an autonomous penetration testing platform that combines a ReAct-based AI agent loop, a multi-tool reconnaissance and vulnerability scanning pipeline, and an AI-driven analysis layer to deliver end-to-end security assessments with minimal human effort. PhantomRed integrates industry-standard open-source tools—Nmap, Nuclei, FFUF, and SQLMap—with a locally hosted Llama 3 8B language model to reason over findings and dispatch targeted follow-up probes. A central design principle is a consent-first ethical framework: every scan requires explicit target pre-authorization via a scope.json manifest, a hard confirmation gate, and a blocklist preventing scans of critical infrastructure. Evaluation on the publicly authorized target scanme.nmap.org demonstrates that PhantomRed surfaces six distinct findings—including CVE-2023-48795 (CVSS 5.9)—in approximately four minutes, compared to an estimated 30–45 minutes for an experienced manual tester. PhantomRed is publicly accessible at phantomred.com under a free tier requiring no payment information.

Files

phantomred_paper_final.pdf

Files (253.3 kB)

Name Size Download all
md5:fd7732615abe2f49fca4487c4f8ef120
253.3 kB Preview Download