Thought Engine: Learnable Cognitive Modules for LLM Agent Networks
Description
We introduce the Thought Engine, where small sets of learnable embedding vectors (soft prompts) are injected into a frozen LLM's key-value cache to serve as silent cognitive modules. Three specialized modules (reasoning, imagination, critique) comprising only 73,728 parameters control a 1.5-billion-parameter frozen Qwen2.5 model—a control ratio of 1:20,000. The Thought Engine improves response quality by +13.3% over baseline while adding negligible inference cost.
Also available in French: Thought Engine : Modules Cognitifs Apprenables pour Réseaux d'Agents LLM
Files
ava-thought-engine-en.pdf
Files
(147.7 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:b9ca3ffc897c12dd4d52eca97f8b2111
|
73.8 kB | Preview Download |
|
md5:e7b6dc81c43a52a09a65bcec16104e24
|
74.0 kB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/AdrienAvalon/avalon-research (URL)