Published 2026
| Version 0.1.2
Journal article
Open
Self-Improvement Agent Harness: A Deterministic SIA Exemplar
Description
This exemplar documents template_sia, a deterministic implementation of the Self-Improvement Agent (SIA) harness contract described in . The default pipeline replays fixture-backed generations for the mini_classify task; opt-in live mode runs bounded target subprocesses and optional Ollama-backed meta/feedback steps.
Run snapshot. Task mini_classify, run 1, 3 generation(s), live=false. Final accuracy=0.8333 over 6 held-out samples. Values are injected by scripts/z_generate_manuscript_variables.py after analysis.
Keywords: self-improvement agents, benchmark harness, reproducible evaluation, agent loops
---
Associated artifacts
GitHub release: v0.1.2 (https://github.com/docxology/template_sia/releases/tag/v0.1.2)
DOI: https://doi.org/10.5281/zenodo.20453879
Zenodo: https://zenodo.org/records/20453879
PDF SHA-256: 6e6d19d04182628bb825471cf8094b5c32d2c491d2c646652ec7e2439ba80773
Files
Friedman_2026_Selfimprovement_6e6d19d0.pdf
Files
(223.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:f8042650bd618719fb4eab9c40563527
|
223.0 kB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/docxology/template_sia/releases/tag/v0.1.2 (URL)