There is a newer version of the record available.

Published February 5, 2026 | Version v3
Preprint Open

Statistical Mechanics of Large Language Models: Free Energy, Order Parameters, and Collective Behavior

Authors/Creators

Description

This paper develops a statistical-mechanical framework for understanding the collective behavior of large language models (LLMs). Building on companion work in topology, information geometry, and many-body physics, the study interprets internal representations as configurations in an empirical ensemble and analyzes their macroscopic organization through free energy, order parameters, entropy, and correlation structure.

The paper introduces effective free-energy functionals for internal representations, clarifies how modular specialization and rank collapse emerge as phase-like transitions, and defines order parameters that distinguish weakly and strongly specialized regimes. The curvature of empirical scaling laws is interpreted through free-energy saturation, while the geometry of the effective energy landscape is analyzed via its Hessian spectrum, revealing flat directions and reduced effective dimensionality.

The appendix provides a unified treatment of entropy, Bayesian evidence, partition functions, and singular learning theory, showing how Fisher degeneracy, spectral collapse, and Watanabe’s asymptotic results naturally align with statistical-mechanical principles. Together, these results offer a coherent theoretical framework for interpreting the structural organization of LLMs and for guiding future research on scaling, architecture, and representation geometry.

Files

llm_statistical_mechanics.pdf

Files (292.4 kB)

Name Size Download all
md5:2037468252fcefa9893bf6a808ebdf0a
275.4 kB Preview Download
md5:c7f5d0cf1a15c6572ac34e9a0a4e371e
17.1 kB Preview Download