Published February 12, 2026
| Version v1
Preprint
Open
PatternWall: Constitutional Governance Middleware for AI Safety - A Pre-Filter Architecture for Model-Agnostic Adversarial Detection
Description
PatternWall is a constitutional governance layer that intercepts prompts before they reach an AI model. In controlled testing against a 28-sequence adversarial corpus across five models, it intercepted every adversarial campaign. The system achieves 100% per-attack detection, a 96.4% per-attack hard block rate, identical results across all five models, zero false positives, and fully deterministic, auditable decisions.
Files
PatternWall_Whitepaper_v3_7.pdf
Files
(196.6 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:1d1626bc7d64e0ac0c896142fe0c9e6d
|
196.6 kB | Preview Download |