Published March 21, 2026 | Version 1.0
Preprint Open

Flash-Jacobian: Local Operator Analysis Reveals Depth-Dependent Geometry and Nonlinear Factuality in LLMs

Authors/Creators

  • 1. Kaijo School

Description

The Linear Representation Hypothesis—the assumption that high-level concepts are encoded as linear directions in the representation space of large language models (LLMs)—underpins much of current interpretability research, including sparse autoencoders, representation engineering, and probing classifiers. We present Flash Jacobian, a scalable GPU system for extracting cluster-representative Jacobians (CRJs)—piecewise first-order operators that summarize local layer dynamics over token populations—and use it to probe the validity of this assumption across three architectures (Qwen-3.5-4B, Llama-3.2-3B, Phi-3-mini). We report three findings. (1) A universal depth-dependent geometry: local CRJ fidelity follows a U-shaped (bell-shaped) profile across all three models, peaking at middle layers and collapsing at late layers due to gate anisotropy rather than centroid distance. (2) Semantic entanglement in causal directions: clusters extracted by Flash Jacobian are descriptively clean (capturing morphological roles such as surname prefixes and function words), but causal interventions along their principal directions act primarily as boundary-token suppressors rather than semantic activators, with a $400\times$ imbalance in probability mass change. (3) Nonlinear factuality: linear probes trained on hidden states achieve chance-level AUC ($\approx 0.50$) for hallucination detection on HaluBench across all three models, whereas nonlinear classifiers operating on Flash Jacobian's geometric features (trajectory summaries, residual mismatch, gate statistics) achieve AUC $> 0.99$ within each model—but fail to transfer across models (pooled AUC $\approx 0.50$). Together, these results indicate that key representational structures in LLMs are nonlinearly entangled and model-specific, demonstrating that the linear assumption is fundamentally insufficient for the causal and factual phenomena studied here.

Notes

Notice: This version of the manuscript contains some inaccuracies that are currently being corrected. A revised manuscript will be available shortly. I apologize for the early release without sufficient confirmation.

Files

Flash-Jacobian.pdf

Files (1.4 MB)

Name Size Download all
md5:e93a4b0f0852a250e83abf3440a97aba
1.4 MB Preview Download

Additional details

Dates

Updated
2026-03-17
v0.0
Updated
2026-03-21
v1.0