Gender Trouble in Language Models: A Critical Audit Guided by Gender Performativity Theory

Hafner, Franziska Sofia; Rocher, Luc

doi:10.5281/zenodo.14370163

Published December 10, 2024 | Version v3

Preprint Open

Gender Trouble in Language Models: A Critical Audit Guided by Gender Performativity Theory

1. University of Oxford

Language models deployed in high-impact sectors like healthcare, education, and law risk perpetuating discrimination against marginalized groups. Existing efforts to reduce biases in models struggle to remove complex prejudices beyond the surface level. Here, we propose a novel framework for evaluating how language models encode and represent gender, rooted in decades of sociological research on gender and language. We identify three key requirements: avoiding essentialism, ensuring meaningful embeddings for all gender identities, and eliminating harmful stereotypes. Testing these requirements on multiple prominent language models, we reveal persistent patterns of gender essentialism, inadequate representations of nonbinary and transgender identities, and harmful pathologizing stereotypes. These findings highlight the need for critical engagement with concepts like gender when auditing language models for their representations. Addressing these issues is crucial to preventing harmful outcomes in high-risk applications, such as biased medical diagnoses, misinformed educational assessments, and weakened legal protections for marginalized communities.

Files

gender-trouble-2024-12-10.pdf

Files (753.9 kB)

Name	Size	Download all
gender-trouble-2024-12-10.pdf md5:e38a27de2208ad43cbe2b3e492669353	753.9 kB	Preview Download

Additional details

Available: 2024-12-10

	All versions	This version
Views	429	181
Downloads	404	176
Data volume	350.2 MB	158.3 MB

Gender Trouble in Language Models: A Critical Audit Guided by Gender Performativity Theory

Creators

Description

Files

gender-trouble-2024-12-10.pdf

Files (753.9 kB)

Additional details

Dates