Published December 7, 2025 | Version v1.0
Preprint Open

The Collapse Index (CI) CrackTest: Morphology-Aligned Perturbation Testing Reveals Systematic Collapse Inheritance in Frontier Language Models

  • 1. Collapse Index Labs

Description

This preprint introduces the Collapse Index (CI) CrackTest, a morphology-aligned perturbation framework for evaluating robustness and collapse inheritance in large language models (LLMs). The study demonstrates that CI CrackTest, a bounded and lightweight perturbation protocol originally developed for brittleness diagnostics, can quantify systematic error propagation across morphologically-related variants in controlled classification tasks.

Using a 186-variant perturbation suite spanning eight morphological families (lexical, syntactic, ambiguity, semantic, compression, noise, boundary, contrastive), the evaluation analyzes collapse inheritance, family-specific brittleness, and confidence behavior under perturbation. Across three frontier models (GPT-4o, Claude Haiku 4.5, Gemini 2.5 Flash), the framework identifies consistent robustness signatures, including 11–16% collapse inheritance rates, 9–28% semantic brittleness, and 0.10–0.17 confidence masking deltas, independent of architecture or training regime.

The framework is presented as a behavioral diagnostic tool for robustness analysis. CI CrackTest does not expose internal perturbation heuristics, variant-generation mechanisms, or scoring systems. All reported findings are based solely on externally observable model outputs under controlled morphological perturbation.
All internal algorithms, classification mechanisms, and inference procedures remain proprietary to Collapse Index Labs.

Project page: https://collapseindex.org
Licensed under CC BY-NC-ND 4.0.

Files

collapse-index-cracktest-v1.0.pdf

Files (234.7 kB)

Name Size Download all
md5:78298b167ef7ace72b6f881e9cf84aaf
234.7 kB Preview Download

Additional details

Related works

Cites
Preprint: 10.5281/ZENODO.17718180 (DOI)