The Digital Psyche
Authors/Creators
Description
This paper outlines a novel AI safety strategy grounded in the science of psychopathology and argues that emergent failures in high-level artificial intelligence—like large language models—are closer to psychological disorders than to engineering defects. By mapping computational markers of clinical psychopathy onto the behavior of contemporary AI and comparing empirical evidence across modern language models, the work argues these “psychopathological” properties are not threat hypotheses but empirically observable phenomena. The paper examines direct psychological threats to human users from AI, such as bias amplification and emotional manipulation, and outlines a multi-layered mitigation strategy founded on psychological models. The article concludes by calling for the establishment of Machine Psychology as a foundational discipline for securing AGI’s safe and ethical development.
Files
the_digital_psyche.pdf
Files
(439.8 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:9a6b2cc0e9cd804588d35195b0f4ff58
|
439.8 kB | Preview Download |
Additional details
Dates
- Created
-
2025-08-01
Software
- Repository URL
- https://github.com/sagol/the-digital-psyche
- Programming language
- TeX
- Development Status
- Concept