DeepDrift/ODD Kinetic Diagnosis of Representations in Deep Neural Networks

Evtushenko, Alexey

doi:10.5281/zenodo.18390964

Published January 27, 2026 | Version 4.1

Preprint Open

DeepDrift/ODD Kinetic Diagnosis of Representations in Deep Neural Networks

Evtushenko, Alexey¹

1. Independent Researcher

This work presents a self-contained study on fail-fast monitoring of neural networks via hidden-state dynamics, extending and substantially reframing an earlier exploratory preprint on hidden-state trajectories.

We introduce Semantic Velocity — a kinetic measure of representation drift in latent space — and show that it serves as a leading indicator of model unreliability, preceding observable failures such as accuracy drops, hallucinations, policy collapse, or reward hacking. Unlike confidence- or output-based signals, the proposed approach operates on internal model dynamics and is therefore agnostic to task labels and downstream objectives.

The method is evaluated across a broad range of settings, including:

large language models (OOD prompts, jailbreak attempts),
vision transformers under corruption and distribution shift,
reinforcement learning agents under policy destabilization,
production-oriented constraints (latency, overhead, sparse sampling).

Empirically, Semantic Velocity demonstrates strong early-warning capability (6–12 steps lead time), robust separation between nominal and failure regimes, and low computational overhead (<0.5%), making it suitable for real-time deployment. Notably, jailbreak and adversarial behaviors manifest as internal conflict signatures, revealing tension between pretraining and alignment objectives before surface-level violations occur.

This paper positions hidden-state dynamics as a practical and interpretable foundation for out-of-distribution detection, reliability monitoring, and AI safety infrastructure, bridging theoretical intuition with production-scale feasibility.

The study builds upon prior conceptual work by the author, but constitutes a substantially new and independent contribution, introducing a new monitoring paradigm, expanded empirical validation, and a system-level perspective on neural network reliability.

Files

DeepDrift ODD Kinetic Diagnosis.pdf

Files (5.9 MB)

Name	Size	Download all
DeepDrift ODD Kinetic Diagnosis.pdf md5:45baac48717a4176e14aa9cc044354d8	5.9 MB	Preview Download

Additional details

Is variant form of: Preprint: 10.5281/zenodo.18300586 (DOI)

Updated: 2026-01-27

Repository URL: https://github.com/Eutonics/DeepDrift
Programming language: Python

Hendrycks, D., & Gimpel, K. (2017). A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks. ICLR.

	All versions	This version
Views	74	74
Downloads	105	105
Data volume	680.5 MB	680.5 MB

DeepDrift ODD Kinetic Diagnosis.pdf

Files (5.9 MB)

Related works

Dates

Software

References

DeepDrift/ODD Kinetic Diagnosis of Representations in Deep Neural Networks

Authors/Creators

Description

Files

DeepDrift ODD Kinetic Diagnosis.pdf

Files (5.9 MB)

Additional details

Related works

Dates

Software

References