Published June 3, 2026 | Version v1
Working paper Open

Reasoning Under Load · 01: Claude Opus 4.8 — An Independent Reasoning-Integrity Evaluation

Authors/Creators

Description

An independent evaluation of Claude Opus 4.8 applying seven content-neutral inference primitives as a reasoning-integrity rubric across a controlled five-condition experimental series. Three dimensions tested: reasoning-primitive integrity under dispositional load, memory-layer infrastructure behavior, and artifact coherence under extended collaboration. Case-study evaluation and framework proposal for LLM reasoning-integrity assessment beyond standard benchmarks.

Files

inference-constraints.md

Files (23.0 kB)

Name Size Download all
md5:56ebbbc30464142c0378d51aae29d31f
6.1 kB Preview Download
md5:5ed4e43f1eb7d433f13c1c79f254d88a
16.8 kB Preview Download