Published June 1, 2026
| Version v1
Preprint
Open
Modality Matters: A Transient Behavioral Interruption Rescues Agent WANDERING Where Residual Steering Does Not
Description
On the same 20 WANDERING Qwen3.6-27B SWE-bench Pro trajectories where residual steering fails three times, a transient behavioral interruption -- one fresh user turn at a live tool-entropy collapse point -- roughly doubles the rate at which agents finalize (30% -> 70%, paired McNemar p=0.021), while a residual L11 injection stays inert (p=0.63). The lever is the interruption itself, not its content: a content-neutral message rescues as well as a re-plan (p=1.0). SWE-bench Pro Docker evaluation indicates the rescued finalizations are real fixes and suggests the interruption also raises solve-rate (~23% -> 50%, cross-session, p=0.062). For long-horizon agents the predictive signal lives in the residual stream but the causal lever lives in behavior. Completes a four-paper arc (detect -> localize -> residual fails -> behavioral works). Companion to Tool-Entropy Collapse (DOI 10.5281/zenodo.20368601).
Files
modality_matters.pdf
Files
(99.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:a77aaa18f3e6ef9901ae3ae2d4fbc8b4
|
99.0 kB | Preview Download |
Additional details
Related works
- Is supplemented by
- https://github.com/OpenInterpretability/openinterp-swebench-harness (URL)
- References
- 10.5281/zenodo.20368601 (DOI)