Simulating Action-Bound AI Safety: Pre-Commitment Monitoring, Strict Gating, and Authority Throttling in a Toy Benchmark

Naing, Htet Ko Ko

doi:10.5281/zenodo.19843231

Published April 28, 2026 | Version v0.6.2

Publication Open

Simulating Action-Bound AI Safety: Pre-Commitment Monitoring, Strict Gating, and Authority Throttling in a Toy Benchmark

Naing, Htet Ko Ko¹

1. Independent Researcher

This paper presents a toy simulation benchmark and cross-language replication check for Action-Bound AI Safety. It evaluates pre-commitment monitoring, strict binary commitment gating, authority throttling, and cost-aware throttled gating in a simplified robotic-arm setting.

The benchmark compares Python multi-seed robustness results with a C++17 replication. The results show that strict binary gating can reduce unsafe commitment but produces high hard false-positive burden, while authority throttling and cost-aware throttled gating preserve most of the safe-stop benefit while sharply reducing unnecessary hard stops.

The results should be interpreted as a simulation-based consistency check under transparent toy assumptions, not as real-world robotic validation or proof of deployed-system safety.

Files

Simulating_Action_Bound_AI_Safety_v0_6_2_complete_Zenodo_package.zip

Files (188.0 kB)

Name	Size	Download all
Simulating_Action_Bound_AI_Safety_v0_6_2_complete_Zenodo_package.zip md5:6d43e482ab921ca92681cb8ec2b9fb61	116.9 kB	Preview Download
Simulating_Action_Bound_AI_Safety_v0_6_2_zenodo_ready.pdf md5:6bd455d20f49aaad49349b3698707bf3	71.1 kB	Preview Download

Additional details

Is derived from: Publication: 10.5281/zenodo.19808983 (DOI)

	All versions	This version
Views	19	19
Downloads	4	4
Data volume	376.0 kB	376.0 kB

Simulating Action-Bound AI Safety: Pre-Commitment Monitoring, Strict Gating, and Authority Throttling in a Toy Benchmark

Authors/Creators

Description

Files

Simulating_Action_Bound_AI_Safety_v0_6_2_complete_Zenodo_package.zip

Files (188.0 kB)

Additional details

Related works