Published March 26, 2026 | Version 1.0
Preprint Open

Quantifying the Irreducible: A Systematic Survey of Adversarial Jailbreak Vectors, Red Team Methodologies, and Enterprise Threat Exposure in Large Language Model Deployments

Authors/Creators

  • 1. HCLTech (HCL America)

Description

This paper presents a systematic survey of adversarial jailbreak attacks against large language models (LLMs), covering 65 peer-reviewed publications from 2022 to 2025 across NeurIPS, ICML, ICLR, USENIX Security, ACM CCS, and EMNLP. We introduce the Enterprise Threat Exposure Model (ETEM), a risk quantification framework comprising three indices: the Adversarial Penetration Index (API), Defense Residual Vulnerability Score (DRVS), and Regulatory Exposure Quotient (REQ). A five-dimensional taxonomy classifies 23 distinct attack methodologies. Cross-model vulnerability analysis covers six model families including Claude, GPT-4, Gemini, LLaMA, DeepSeek, and Mistral. Emergent threats from agentic architectures, MCP tooling exploitation, and chain-of-thought hijacking in reasoning models are examined. A prescriptive five-layer defense-in-depth architecture and seven open research challenges are identified.

Files

Zenodo_Quantifying_the_Irreducible_Final.pdf

Files (188.4 kB)

Name Size Download all
md5:ba2b76a5daaf4c21ad9d3d32a041ddbf
188.4 kB Preview Download

Additional details

Funding

Fundação para a Ciência e Tecnologia
IEETA - Institute of Electronics and Informatics Engineering of Aveiro UID/00127/2025
Fundação para a Ciência e Tecnologia
CIIC - Computer Science and Communications Research Centre UID/04524/2025
European Commission
NeuralTrust - LEADING CYBERSECURITY IN THE GENERATIVE AI ERA 101247606
Fundação para a Ciência e Tecnologia
IEETA - Institute of Electronics and Informatics Engineering of Aveiro UID/PRR/00127/2025
European Commission
CyberSANE - Cyber Security Incident Handling, Warning and Response System for the European Critical Infrastructures 833683
European Commission
CyberSure - CYBER Security InSURancE — A Framework for Liability Based Trust 734815

Software