Published June 1, 2026 | Version v1

Modality Matters: A Transient Behavioral Interruption Rescues Agent WANDERING Where Residual Steering Does Not

Authors/Creators

  • 1. OpenInterpretability

Description

On the same 20 WANDERING Qwen3.6-27B SWE-bench Pro trajectories where residual steering fails three times, a transient behavioral interruption -- one fresh user turn at a live tool-entropy collapse point -- roughly doubles the rate at which agents finalize (30% -> 70%, paired McNemar p=0.021), while a residual L11 injection stays inert (p=0.63). The lever is the interruption itself, not its content: a content-neutral message rescues as well as a re-plan (p=1.0). SWE-bench Pro Docker evaluation indicates the rescued finalizations are real fixes and suggests the interruption also raises solve-rate (~23% -> 50%, cross-session, p=0.062). For long-horizon agents the predictive signal lives in the residual stream but the causal lever lives in behavior. Completes a four-paper arc (detect -> localize -> residual fails -> behavioral works). Companion to Tool-Entropy Collapse (DOI 10.5281/zenodo.20368601).

Files

modality_matters.pdf

Files (99.0 kB)

Name Size Download all
md5:a77aaa18f3e6ef9901ae3ae2d4fbc8b4
99.0 kB Preview Download

Additional details