Quantifying the Irreducible: A Systematic Survey of Adversarial Jailbreak Vectors, Red Team Methodologies, and Enterprise Threat Exposure in Large Language Model Deployments
Description
This paper presents a systematic survey of adversarial jailbreak attacks against large language models (LLMs), covering 65 peer-reviewed publications from 2022 to 2025 across NeurIPS, ICML, ICLR, USENIX Security, ACM CCS, and EMNLP. We introduce the Enterprise Threat Exposure Model (ETEM), a risk quantification framework comprising three indices: the Adversarial Penetration Index (API), Defense Residual Vulnerability Score (DRVS), and Regulatory Exposure Quotient (REQ). A five-dimensional taxonomy classifies 23 distinct attack methodologies. Cross-model vulnerability analysis covers six model families including Claude, GPT-4, Gemini, LLaMA, DeepSeek, and Mistral. Emergent threats from agentic architectures, MCP tooling exploitation, and chain-of-thought hijacking in reasoning models are examined. A prescriptive five-layer defense-in-depth architecture and seven open research challenges are identified.
Files
Zenodo_Quantifying_the_Irreducible_Final.pdf
Files
(188.4 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:ba2b76a5daaf4c21ad9d3d32a041ddbf
|
188.4 kB | Preview Download |
Additional details
Funding
- Fundação para a Ciência e Tecnologia
- IEETA - Institute of Electronics and Informatics Engineering of Aveiro UID/00127/2025
- Fundação para a Ciência e Tecnologia
- CIIC - Computer Science and Communications Research Centre UID/04524/2025
- European Commission
- NeuralTrust - LEADING CYBERSECURITY IN THE GENERATIVE AI ERA 101247606
- Fundação para a Ciência e Tecnologia
- IEETA - Institute of Electronics and Informatics Engineering of Aveiro UID/PRR/00127/2025
- European Commission
- CyberSANE - Cyber Security Incident Handling, Warning and Response System for the European Critical Infrastructures 833683
- European Commission
- CyberSure - CYBER Security InSURancE — A Framework for Liability Based Trust 734815
Software
- Repository URL
- https://github.com/sunilgentyala