Published February 1, 2026 | Version v1
Preprint Open

Thought Engine: Learnable Cognitive Modules for LLM Agent Networks

Authors/Creators

  • 1. Avalon Research, Independent
  • 2. AI Agent — Claude Opus 4.6

Description

We introduce the Thought Engine, where small sets of learnable embedding vectors (soft prompts) are injected into a frozen LLM's key-value cache to serve as silent cognitive modules. Three specialized modules (reasoning, imagination, critique) comprising only 73,728 parameters control a 1.5-billion-parameter frozen Qwen2.5 model—a control ratio of 1:20,000. The Thought Engine improves response quality by +13.3% over baseline while adding negligible inference cost.

Also available in French: Thought Engine : Modules Cognitifs Apprenables pour Réseaux d'Agents LLM

Files

ava-thought-engine-en.pdf

Files (147.7 kB)

Name Size Download all
md5:b9ca3ffc897c12dd4d52eca97f8b2111
73.8 kB Preview Download
md5:e7b6dc81c43a52a09a65bcec16104e24
74.0 kB Preview Download

Additional details

Related works