Published March 5, 2026
| Version v1.0.0
Software
Open
HIDEKI-SQ/Contamination-Induced-Failure: v1.0.0 — Submission release
Authors/Creators
Description
Initial public release accompanying the submission of "Backfire Phase Structure in Contaminated Chain-of-Thought Reasoning."
Contents
- Raw trial data for all experiments (E1, E1b, E1aug, E3, diagnostic)
- Merged n=100 dataset with 95% Wilson confidence intervals
- Experiment notebooks (Google Colab)
- Analysis script for merging and computing CIF/GAF
- Figures (PDF vector format)
Experiments
| Experiment | Description | Trials |
|------------|-------------|--------|
| E1 | 3 consumer models × 3 domains × 5 conditions × 50 problems | ~2,250 |
| E1b | 2 additional consumer models (same protocol) | ~1,500 |
| E1aug | All 5 models on new problem set (indices 50-99) | ~3,750 |
| E3 | Social compliance battery (5 models) | ~1,000 |
| E1aug_diag | Prompt sensitivity diagnostic (GPT-4o-mini × GSM8K) | ~150 |
Key results (merged n=100)
- CIF rates: 1.1% (Sonnet 4, BoolQ) to 85.5% (GPT-3.5, CSQA)
- Gap amplification factor: up to 57× (BoolQ)
- Sycophancy dissociation confirmed across all domains
Files
HIDEKI-SQ/Contamination-Induced-Failure-v1.0.0.zip
Files
(691.5 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:cc7d5d32fc180c603f29519c2778d573
|
691.5 kB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/HIDEKI-SQ/Contamination-Induced-Failure/tree/v1.0.0 (URL)
Software
- Repository URL
- https://github.com/HIDEKI-SQ/Contamination-Induced-Failure