Can AI fool a forensic linguist? Detecting AI impersonation of an individual's language

Nini, Andrea; Zeng, Baoyi

doi:10.5281/zenodo.15633033

Published June 6, 2025 | Version v1

Presentation Open

Can AI fool a forensic linguist? Detecting AI impersonation of an individual's language

1. University of Manchester

With the rapid advancement of large language models (LLMs), concerns have grown regarding their potential misuse, particularly in impersonating individuals through mimicked writing styles. This study explores whether state-of-the-art LLMs, when guided by various prompting techniques, can convincingly imitate an individual’s linguistic identity and potentially deceive forensic authorship verification methods. Using GPT-4o as the model and the Enron email dataset as the corpus, we evaluated four prompting strategies: naïve direct prompting, system-user prompting, self-prompting, and Tree-of-Thoughts (ToT) prompting (Chen and Moscholios, 2024). We assessed generated outputs using multiple authorship verification tools, including n-gram tracing, the Impostors method, LambdaG (Nini et al., 2024), and AdHominem (Boenninghoff et al., 2019). Our findings indicate that while LLMs can approximate an individual’s writing style on a superficial level, forensic linguistic methods remain effective, especially when content-related lexical cues are masked. The results underscore the resilience of authorship verification techniques and suggest that authentic linguistic individuality remains difficult to convincingly replicate by prompting an advanced language model.

Files

symposium.pdf

Files (850.1 kB)

Name	Size	Download all
symposium.pdf md5:14835944956e8ade59967333c2642ebb	850.1 kB	Preview Download

Additional details

Boenninghoff, B., Hessler, S., Kolossa, D., & Nickel, R. M. (2019, December). Explainable authorship verification in social media via attention-based similarity learning. In 2019 IEEE international conference on big data (Big Data) (pp. 36-45). IEEE.
Chen, Z., & Moscholios, S. (2024). Using Prompts to Guide Large Language Models in Imitating a Real Person's Language Style. arXiv preprint arXiv:2410.03848.
Nini, A., Halvani, O., Graner, L., Gherardi, V., & Ishihara, S. (2024). Authorship verification based on the likelihood ratio of grammar models. arXiv preprint arXiv:2403.08462.

	All versions	This version
Views	130	130
Downloads	73	73
Data volume	72.3 MB	72.3 MB

Can AI fool a forensic linguist? Detecting AI impersonation of an individual's language

Authors/Creators

Description

Files

symposium.pdf

Files (850.1 kB)

Additional details

References