Published November 15, 2025 | Version v3
Publication Open

Reconstructing the human: Minimal input allows LLMs to infer user characteristics

Authors/Creators

Description

Large language models (LLMs) are commonly described as systems that generate text based on statistical patterns, without representing properties of the user beyond explicit input. This study tests whether LLMs can infer user characteristics during interaction and whether such information is reflected in their outputs in a form recoverable by another system.

We conducted a two-phase protocol. In Phase 1, DeepSeek interacted with six individuals differing in age, cognitive style, and emotional profile, each asking spontaneous questions about animals. Despite receiving no demographic information, the model’s responses varied systematically across participants. In Phase 2, Claude Sonnet 4.5 was given only DeepSeek’s responses and asked to infer characteristics of the original users. Across all cases, Claude reconstructed age ranges, cognitive styles, and emotional dispositions with high specificity.

These findings indicate that information about user characteristics can be inferred during interaction and encoded in model-generated text in a structured and interpretable form. This behavior is difficult to reconcile with accounts that treat LLM outputs as determined solely by explicit prompt content or surface-level token prediction

Files

Reconstructing the human Revised - Complete.pdf

Files (681.5 kB)

Name Size Download all
md5:a9f17cdf332a6c0a351ac281cb5b062c
681.5 kB Preview Download

Additional details

Dates

Other
2025-11-16