A Bifurcation-Aware Policy Layer for the Learning System Stability Model: Margin-Regulated Exploration Scheduling with Corrected Regret Bounds and Hysteresis

Bankuti, Omri

doi:10.5281/zenodo.19005510

Published March 13, 2026 | Version 1.1.0

Preprint Open

A Bifurcation-Aware Policy Layer for the Learning System Stability Model: Margin-Regulated Exploration Scheduling with Corrected Regret Bounds and Hysteresis

Bankuti, Omri

This paper derives a bifurcation-aware policy layer for the Learning System Stability Model (LSSM). The central construction is the Margin-Regulated bonus multiplier (MRER): beta(t) = beta_max · sigma(M(t)/M_max), where M = I_cap − L is the LSSM stability margin. Three formal results are established: (T1) MRER-UCB achieves O(ln T) cumulative regret with constant factor 1/beta²_min; (T2) MRER preserves the LSSM stability constraint L ≤ E·S_sys² under an explicit safe-action set condition; (T3) given LSSM bistability, MRER-UCB inherits hysteresis — producing lower exploration bonus on the collapse branch than on the recovery branch at the same nominal load. Version 1.1.0 incorporates corrections following peer review by ChatGPT (OpenAI) and DeepSeek AI. All results are empirically confirmed across 50 Monte Carlo runs in the companion software (DOI: 10.5281/zenodo.19005510).

Files

lssm_policy_layer_v1_1_0-2.pdf

Files (40.1 kB)

Name	Size	Download all
lssm_policy_layer_v1_1_0-2.pdf md5:ba67943d651dc203d25dd33fda64718a	40.1 kB	Preview Download

Additional details

Views

Downloads

Show more details

	All versions	This version
Views	12	12
Downloads	5	5
Data volume	240.5 kB	240.5 kB

More info on how stats are collected....

DOI

Resource type

Preprint

Publisher

Zenodo

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 13, 2026
Modified: March 15, 2026

A Bifurcation-Aware Policy Layer for the Learning System Stability Model: Margin-Regulated Exploration Scheduling with Corrected Regret Bounds and Hysteresis

Authors/Creators

Description

Files

lssm_policy_layer_v1_1_0-2.pdf

Files (40.1 kB)

Additional details

Related works