There is a newer version of the record available.

Published December 19, 2025 | Version v3
Preprint Open

DAWN: Dynamic Subspace Composition for Structural Interpretability

Authors/Creators

  • 1. Independent Researcher

Description

DAWN replaces static weight matrices with dynamic subspace composition over shared neuron pools. Each token dynamically constructs its own projection matrix by selecting sparse neuron subsets, enabling structural interpretability through traceable routing paths.

With 39M parameters, DAWN achieves comparable performance to a 40M vanilla Transformer (perplexity 42.2 vs 41.2) on C4 (5B tokens) while providing interpretable internal structure. Analysis reveals emergent neuron specialization: Query/Key neurons develop distinct roles (correlation r=-0.75), and neurons align with part-of-speech categories without explicit supervision.

This upload contains the paper (PDF) and pretrained model weights (PyTorch).

Code: https://github.com/madst0614/DAWN

Files

dawn.pdf

Files (158.3 MB)

Name Size Download all
md5:502e09d7403e17d5518346fa9a6c179d
1.2 MB Preview Download
md5:e0f59ba273ff3759bcdc9a9c4aa699fd
157.0 MB Download

Additional details

Software