Published August 21, 2025 | Version 1.0

The Digital Psyche

Description

This paper outlines a novel AI safety strategy grounded in the science of psychopathology and argues that emergent failures in high-level artificial intelligence—like large language models—are closer to psychological disorders than to engineering defects. By mapping computational markers of clinical psychopathy onto the behavior of contemporary AI and comparing empirical evidence across modern language models, the work argues these “psychopathological” properties are not threat hypotheses but empirically observable phenomena. The paper examines direct psychological threats to human users from AI, such as bias amplification and emotional manipulation, and outlines a multi-layered mitigation strategy founded on psychological models. The article concludes by calling for the establishment of Machine Psychology as a foundational discipline for securing AGI’s safe and ethical development.

Files

the_digital_psyche.pdf

Files (439.8 kB)

Name Size Download all
md5:9a6b2cc0e9cd804588d35195b0f4ff58
439.8 kB Preview Download

Additional details

Dates

Created
2025-08-01

Software

Repository URL
https://github.com/sagol/the-digital-psyche
Programming language
TeX
Development Status
Concept