Book section Open Access

Linguistic Bias in Crowdsourced Biographies: A Cross-lingual Examination

Jahna Otterbacher; Ioannis Katakis; Pantelis Agathangelou

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Jahna Otterbacher</dc:creator>
  <dc:creator>Ioannis Katakis</dc:creator>
  <dc:creator>Pantelis Agathangelou</dc:creator>
  <dc:description>Biographies make up a significant portion of Wikipedia entries and are a source of information and inspiration for the public. We examine a threat to their objectivity, linguistic biases, which are pervasive in human communication. Linguistic bias, the systematic asymmetry in the language used to describe people as a function of their social groups, plays a role in the perpetuation of stereotypes. Theory predicts that we describe people who are expected – because they are members of our own in-groups or are stereotype-congruent – with more abstract, subjective language, as compared to others. Abstract language has the power to sway our impressions of others as it implies stability over time. Extending our monolingual work, we consider biographies of intellectuals at the English- and Greek-language Wikipedias. We use our recently introduced sentiment analysis tool, DidaxTo, which extracts domain-specific opinion words to build lexicons of subjective words in each language and for each gender, and compare the extent to which abstract language is used. Contrary to expectation, we find evidence of gender-based linguistic bias, with women being described more abstractly as compared to men. However, this is limited to English-language biographies. We discuss the implications of using DidaxTo to monitor linguistic bias in texts produced via crowdsourcing.

  <dc:description>This work has been partly supported by the project that has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 739578 (RISE – Call: H2020-WIDESPREAD-01-2016-2017-TeamingPhase2)  and the Government of the Republic of Cyprus through the Directorate General for European Programmes, Coordination and Development.

Electronic version of a book chapter article published as Multilingual Text Analysis Challenges, Models, and Approaches, 2019, 411–440,] © 2019 World Scientific Publishing Company, .</dc:description>
  <dc:publisher>World Scientific</dc:publisher>
  <dc:subject>Linguistic Bias</dc:subject>
  <dc:subject>Crowdsourced Biographies</dc:subject>
  <dc:title>Linguistic Bias in Crowdsourced Biographies: A Cross-lingual Examination</dc:title>
Views 157
Downloads 108
Data volume 50.0 MB
Unique views 136
Unique downloads 104


Cite as