Observer Depth: Quantifying Reflexive Intelligence in LLMs via Phase Transition Analysis
Authors/Creators
Description
We present ReflexBench, the first benchmark designed to evaluate reflexive reasoning in large language models - the capacity to reason about one's own causal impact on the environment being analyzed. ReflexBench comprises 20 scenarios across 6 domains, each probing four levels of Observer Depth (OD). We evaluate 5 frontier LLMs and find that all exhibit systematic degradation at higher observer depths (mean Delta = -0.50). We propose the Soros Test as a practical standard for evaluating observer-participant readiness and document that reflexive capabilities emerge through a phase transition during multi-reward GRPO training.
Files
paper2_reflexbench.pdf
Files
(482.5 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:4c64d1fed0e2ef657ab2fa4821ad8ca5
|
154.9 kB | Preview Download |
|
md5:7e00fa75a273f31010281524096e9aa1
|
327.5 kB | Preview Download |