Tracking Linguistic Shifts in Russian Propaganda: A Diachronic Analysis Using KLD
Description
Language, being a powerful tool, can be manipulated by malicious actors to shape public opinion. In this study, we examine the diachronic evolution of Russian propaganda about the Russo-Ukrainian War. Since linguistic shifts are shaped by social, political, and cultural factors, analyzing linguistic change in propaganda helps advance media literacy and enhance disinformation detection. This is evident in the evolving nature of fake news (Adriani, 2019), with Russian propaganda serving as a key example used to justify the invasion and mobilize public support (Solopova et al., 2023).
We use Kullback-Leibler Divergence (KLD; Kullback & Leibler, 1951) as our method, which offers an interpretable way to identify distinctive linguistic features. In previous work, we have demonstrated that KLD is suitable for detecting divergences in propaganda strategies between state-controlled and social media covering the Russo-Ukrainian War (Vestel & Degaetano-Ortlieb, 2025). This study extends prior work by applying KLD to edits in a Russian Wikipedia Fork (RWFork; Trokhymovych et al., 2025), thus adding another text type and a diachronic dimension to the analysis.
RWFork, created in June 2023, is a copy of Russian Wikipedia revised to comply with Russia’s legislation (Cohen, 2023). We use Trokhymovych et al.'s (2025) dataset, containing edits from 1.9M page titles between May and September 2023; most of the changes involve knowledge manipulation linked to the 2022 Russian invasion of Ukraine. KLD is employed to identify contrasting language between Russian Wikipedia and RWFork. Specifically, KLD quantifies the divergence between two probability distributions by measuring how many more bits of information are needed to represent one using the other, highlighting distinctive features (e.g., words) contributing to linguistic differences.
We hypothesize that differences between Russian Wikipedia and RWFork resemble those between state and social media. In particular, direct terms like war, distinctive for social media, are often replaced by euphemisms in state media (Vestel & Degaetano-Ortlieb, 2025); similarly, Trokhymovych et al. (2025) have shown that the words war and invasion are removed from RWFork. Moreover, RWFork replaces names of occupied Ukrainian regions with Kremlin-aligned terms, which is also characteristic of state media (Trokhymovych et al., 2025; Vestel & Degaetano-Ortlieb, 2025). However, since Wikipedia is stylistically different from both media types, we also expect to gain insights into the specificities of Russian propaganda in this genre.
Files
Abstract_Vestel_Degaetano-Ortlieb.pdf
Files
(679.1 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:1146279a7745ddf7aeaa390270363d4b
|
181.3 kB | Preview Download |
|
md5:7a6df2357e9b18deed64917a1485de65
|
497.7 kB | Preview Download |
Additional details
Funding
Dates
- Submitted
-
2025-09-24
- Accepted
-
2025-10-31
References
- Adriani, R. (2019). The Evolution of Fake News and the Abuse of Emerging Technologies. European Journal of Social Sciences, 2(1), Article 1. https://doi.org/10.26417/ejss-2019.v2i1-53
- Cohen, N. (2023, July 12). Russian Wikipedia's Top Editor Leaves to Launch a Putin-Friendly Clone. Bloomberg.Com. https://www.bloomberg.com/news/articles/2023-07-12/russian-wikipedia-editor-leaves-to-launch-a-putin-friendly-clone
- Kullback, S., & Leibler, R. A. (1951). On Information and Sufficiency. The Annals of Mathematical Statistics, 22(1), Article 1. https://doi.org/10.1214/aoms/1177729694
- Solopova, V., Benzmüller, C., & Landgraf, T. (2023). The Evolution of Pro-Kremlin Propaganda From a Machine Learning and Linguistics Perspective. In M. Romanyshyn (Ed.), Proceedings of the Second Ukrainian Natural Language Processing Workshop (UNLP) (pp. 40–48). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.unlp-1.5
- Trokhymovych, M., Kosovan, O., Forrester, N., Aragón, P., Saez-Trumper, D., & Baeza-Yates, R. (2025). Characterizing Knowledge Manipulation in a Russian Wikipedia Fork (No. arXiv:2504.10663). arXiv. https://doi.org/10.48550/arXiv.2504.10663
- Vestel, A., & Degaetano-Ortlieb, S. (2025). From War to Special Military Operation: Interpretable Detection of Linguistic Propaganda Framing in Russian Media. Workshop Proceedings of the 19th International AAAI Conference on Web and Social Media, 2025, 50. https://doi.org/10.36190/2025.50