Published 2025 – 2026 | Version v2
Journal article Open

The Genzie Code: A Diachronic and Multi-Modal Analysis of the Structure, and Social Function of Gen-Z Digital Language

  • 1. ROR icon National Research University Higher School of Economics

Description

The emergence of a distinct Gen-Z sociolect, often termed "Genzie" or "Internet Slang," represents one of the most rapid and transformative linguistic developments of the digital age. This language is not a random collection of slang but a complex, rule-governed system born from the intersection of technology, social change, and identity formation. A comprehensive, data-driven analysis of its historical evolution, structural properties, and socio-pragmatic functions is critical to understanding contemporary communication, as it reflects fundamental shifts in how a generation conceptualizes interaction, community, and self-expression. This study aims to deconstruct the Gen-Z sociolect by tracing its historical development over a key 36-month period (2021-2023), analyzing its core structural components (lexical, semantic, syntactic, multimodal), and explaining its social functions within digital communities. The research seeks to move beyond anecdotal description to provide a rigorous, empirical account of this dynamic linguistic phenomenon, thereby establishing a benchmark for the academic study of internet-native dialects. We position this sociolect not as a degradation of Standard English, but as a legitimate linguistic innovation worthy of serious scholarly attention, with its own internal logic and systemic coherence. A mixed-methods, diachronic approach was employed, integrating the scale of computational linguistics with the nuance of qualitative discourse analysis. A large-scale corpus of approximately 6000 posts was compiled from three core platforms—Twitter/X, Instagram, and TikTok—across the 12-month timeframe, ensuring a representative sample of public-facing Gen-Z communication. Computational linguistics methods were used for quantitative analysis, including time-series modeling for lexical diffusion, diachronic word embeddings for semantic shift, and supervised machine learning for stylometric identification. This was complemented by qualitative discourse and pragmatic analysis of a stratified sample of posts to understand language-in-use, focusing on the interplay between text, image, and platform-specific conventions. The analysis reveals a clear, platform-influenced historical trajectory for Gen-Z language, with terms originating on niche, visually-driven forums like TikTok and Twitch before achieving mass diffusion on the text-centric environment of Twitter and finally being normalized on the broader social canvas of Instagram. We identified and modeled three primary mechanisms of lexical creation: neologism (e.g., "skibidi," "gyatt"), semantic reappropriation (e.g., "cap," "based," "fire"), and phono-semantic matching from online cultures (e.g., "ratio," "L + RIP bozo"). Gen-Z language is a legitimate and sophisticated dialect of the digital era, a natural linguistic adaptation to a hyper-connected, attention-economy-driven world. Its evolution is not chaotic but follows predictable patterns of cultural transmission that are dramatically amplified and accelerated by social media algorithms. Its structure efficiently manages cognitive load in fast-paced digital environments while its primary functions are the performance of a specific digital identity, the creation and policing of digital community boundaries, and a form of resistance to traditional linguistic and social norms.

Files

Files (30.8 kB)

Name Size Download all
md5:138279a2d813b21686c81a5389172e3d
30.8 kB Download

Additional details

References

  • Blank, A. (1999). Why do new meanings occur? A cognitive typology of the motivations for lexical semantic change. In A. Blank & P. Koch (Eds.), Historical Semantics and Cognition. De Gruyter. boyd, d. (2010). Social network sites as networked publics: Affordances, dynamics, and implications. In Z. Papacharissi (Ed.), A Networked Self. Routledge. Brown, P., & Levinson, S. C. (1987). Politeness: Some universals in language usage. Cambridge University Press. Chen, M. (1972). The time dimension: Contribution toward a theory of sound change. Foundations of Language, 8(4), 457–498. Eckert, P., & McConnell-Ginet, S. (1992). Think practically and look locally: Language and gender as community-based practice. Annual Review of Anthropology, 21, 461-490. Eisenstein, J. (2013). What to do about bad language on the internet. Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics. Kress, G., & van Leeuwen, T. (2001). Multimodal discourse: The modes and media of contemporary communication. Arnold.