Book section Open Access

Linguistic Bias in Crowdsourced Biographies: A Cross-lingual Examination

Jahna Otterbacher; Ioannis Katakis; Pantelis Agathangelou

Citation Style Language JSON Export

  "publisher": "World Scientific", 
  "DOI": "10.1142/9789813274884_0012", 
  "ISBN": "978-981-3274-87-7", 
  "container_title": "Multilingual Text Analysis Challenges, Models, and Approaches", 
  "language": "eng", 
  "title": "Linguistic Bias in Crowdsourced Biographies: A Cross-lingual Examination", 
  "issued": {
    "date-parts": [
  "abstract": "<p>Biographies make up a significant portion of Wikipedia entries and are a source of information and inspiration for the public. We examine a threat to their objectivity, linguistic biases, which are pervasive in human communication. Linguistic bias, the systematic asymmetry in the language used to describe people as a function of their social groups, plays a role in the perpetuation of stereotypes. Theory predicts that we describe people who are expected &ndash; because they are members of our own in-groups or are stereotype-congruent &ndash; with more abstract, subjective language, as compared to others. Abstract language has the power to sway our impressions of others as it implies stability over time. Extending our monolingual work, we consider biographies of intellectuals at the English- and Greek-language Wikipedias. We use our recently introduced sentiment analysis tool, DidaxTo, which extracts domain-specific opinion words to build lexicons of subjective words in each language and for each gender, and compare the extent to which abstract language is used. Contrary to expectation, we find evidence of gender-based linguistic bias, with women being described more abstractly as compared&nbsp;to men. However, this is limited to English-language biographies. We discuss the implications of using DidaxTo to monitor linguistic bias in texts produced via crowdsourcing.</p>\n\n<p>&nbsp;</p>", 
  "author": [
      "family": "Jahna Otterbacher"
      "family": "Ioannis Katakis"
      "family": "Pantelis Agathangelou"
  "page": "411\u2013440", 
  "note": "This work has been partly supported by the project that has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 739578 (RISE \u2013 Call: H2020-WIDESPREAD-01-2016-2017-TeamingPhase2)  and the Government of the Republic of Cyprus through the Directorate General for European Programmes, Coordination and Development.\n\nElectronic version of a book chapter article published as Multilingual Text Analysis Challenges, Models, and Approaches, 2019, 411\u2013440,] \u00a9 2019 World Scientific Publishing Company, .", 
  "version": "Accepted pre-print", 
  "type": "chapter", 
  "id": "2671672"
Views 95
Downloads 24
Data volume 11.1 MB
Unique views 83
Unique downloads 21


Cite as