Clustering versus Projection Debiasing for Contextualized Embeddings on WinoBias and CrowS-Pairs Gender Benchmarks

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20639880

Published June 11, 2026 | Version v1

Report Open

Clustering versus Projection Debiasing for Contextualized Embeddings on WinoBias and CrowS-Pairs Gender Benchmarks

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Contextualized word embeddings have been replacing standard embeddings as the representational knowledge source of choice in NLP systems. Since a variety of biases have previously been found in standard word embeddings, it is crucial to assess biases encoded in their replacements as well. Focusing on BERT (Devlin et al., 2018), we measure gender bias by studying associations between gender-denoting target words and names of professions in English and German, comparing the findings with real-world workforce statistics. We mitigate bias by fine-tuning BERT on the GAP corpus (Webster et al., 2018

Research goal: How do clustering-based debiasing techniques for contextualized embeddings compare to projection-based methods in terms of accuracy on the WinoBias and CrowS-Pairs gender bias benchmarks?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.7/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.7/10.

Files

paper.pdf

Files (91.5 kB)

Name	Size	Download all
paper.pdf md5:b6f7c681d27dee8e19d61a35c24554f3	91.5 kB	Preview Download

	All versions	This version
Views	1	1
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Clustering versus Projection Debiasing for Contextualized Embeddings on WinoBias and CrowS-Pairs Gender Benchmarks

Authors/Creators

Description

Notes

Files

paper.pdf

Files (91.5 kB)