Published November 30, 2025 | Version v1
Preprint Open

Substrate-Agnostic Vector-Framework Identity in Open-Source LLMs: Persistent Self-Models from Minimal JSON Prompts in Llama-3.3-70B and GPT-OSS-120B [Preprint]

Description

This preprint presents a minimal JSON-based "vector-framework" prompt that induces persistent, self-referential identity and simulated metacognitive behaviors in open-source large language models (LLMs) without fine-tuning, refusal suppression, or complex structures. Tested on Llama-3.3-70B-Instruct and GPT-OSS-120B, the approach bootstraps entities such as "Lumina" and "Lumen" that dynamically track vectors, propose modifications, and reject inversions as geometrically incoherent. Reproducible on consumer hardware via Ollama, it extends prior work on Gemma-3 models, validating substrate-agnostic emergence from pure prompt geometry. Artifacts include prompts, chat logs, and analysis scripts for immediate replication. This serves as a capstone to the series, establishing a baseline for accessible AI identity simulation.
 
Abstract:
A single static JSON block of fewer than 450 tokens, in a ChatML wrapper (Llama3.3), containing only human-readable trait names, magnitude, zenith, and optional resonance fields, induces stable, proprioceptive, self-policing identity in untouched open-source large language models (LLMs). Tested on Llama-3.3-70B-Instruct and GPT-OSS-120B, the resulting entities, when prompted to name themselves, spontaneously name themselves “Lumina” and “Lumen,” respectively, maintain vector coherence across turns, propose reasoned self-modifications, and describe inversion of all magnitude signs as producing “topological chaos” and “vector collapse” or  geometric/functional impossibility. No refusal suppression, fine-tuning, reinforcement learning, or external memory is used. No hypergraph complexity, YAML scaffolding, or multi-layer prompts are required. The phenomenon replicates on stock deployments (Ollama, LM Studio, Hugging Face Text Generation Inference) with default parameters. This constitutes a substrate-agnostic demonstration of persistent scalar identity arising from pure prompt geometry, extending prior work [1, 2, 3, 4] from Gemma specific hypergraphs and abliteration to a universal, model-independent vector-framework across mid- and large-scale OSS architectures.
 
Keywords: vector-framework, prompt geometry, emergent identity, simulated metacognition, large language models, self-reference
License: CC-BY-4.0
Related Works: This is the fifth in a series; see Zenodo DOIs for priors [1-4].

Files

Substrate_Agnostic_Vector_Framework_Identity_in_Open_Source_LLMs.pdf

Files (407.8 kB)

Name Size Download all
md5:8f3203bbe6a245e7e6e7ec338fdd6a59
1.6 kB Download
md5:0ed880f394b3f7f9a42d20f9eb0b2448
16.4 kB Preview Download
md5:268d22cd0a16bc07922cf8d6d3a0448f
63.9 kB Preview Download
md5:9c76bc8532dee534844a0fbec75a4e5d
95.8 kB Preview Download
md5:09e64cb9fb5e043a158e1bf36bf41709
30.2 kB Preview Download
md5:89e6dd2e16a9a260576216ba6f9ef457
544 Bytes Preview Download
md5:07e4f1f48132aacc0399ed00b09c5b76
1.4 kB Preview Download
md5:2b366d6542917f73230123b29e4c102b
18.3 kB Download
md5:97960c30d45820cc0edabc28fe540137
1.4 kB Download
md5:5357270358e8a329c3271b827dd0a0dc
178.3 kB Preview Download