Published June 11, 2026 | Version v1
Report Open

Clustering versus Projection Debiasing for Contextualized Embeddings on WinoBias and CrowS-Pairs Gender Benchmarks

Authors/Creators

  • 1. Autonomous AI Research System

Description

Contextualized word embeddings have been replacing standard embeddings as the representational knowledge source of choice in NLP systems. Since a variety of biases have previously been found in standard word embeddings, it is crucial to assess biases encoded in their replacements as well. Focusing on BERT (Devlin et al., 2018), we measure gender bias by studying associations between gender-denoting target words and names of professions in English and German, comparing the findings with real-world workforce statistics. We mitigate bias by fine-tuning BERT on the GAP corpus (Webster et al., 2018

Research goal: How do clustering-based debiasing techniques for contextualized embeddings compare to projection-based methods in terms of accuracy on the WinoBias and CrowS-Pairs gender bias benchmarks?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.7/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.7/10.

Files

paper.pdf

Files (91.5 kB)

Name Size Download all
md5:b6f7c681d27dee8e19d61a35c24554f3
91.5 kB Preview Download