Published March 13, 2026 | Version 1.1.0
Preprint Open

A Bifurcation-Aware Policy Layer for the Learning System Stability Model: Margin-Regulated Exploration Scheduling with Corrected Regret Bounds and Hysteresis

Authors/Creators

Description

This paper derives a bifurcation-aware policy layer for the Learning System Stability Model (LSSM). The central construction is the Margin-Regulated bonus multiplier (MRER): beta(t) = beta_max · sigma(M(t)/M_max), where M = I_cap − L is the LSSM stability margin. Three formal results are established: (T1) MRER-UCB achieves O(ln T) cumulative regret with constant factor 1/beta²_min; (T2) MRER preserves the LSSM stability constraint L ≤ E·S_sys² under an explicit safe-action set condition; (T3) given LSSM bistability, MRER-UCB inherits hysteresis — producing lower exploration bonus on the collapse branch than on the recovery branch at the same nominal load. Version 1.1.0 incorporates corrections following peer review by ChatGPT (OpenAI) and DeepSeek AI. All results are empirically confirmed across 50 Monte Carlo runs in the companion software (DOI: 10.5281/zenodo.19005510).

Files

lssm_policy_layer_v1_1_0-2.pdf

Files (40.1 kB)

Name Size Download all
md5:ba67943d651dc203d25dd33fda64718a
40.1 kB Preview Download

Additional details