DIBIMT: A Novel Benchmark for Measuring Word Sense Disambiguation Biases in Machine Translation

Niccolò Campolungo*; Federico Martelli*; Franesco Saia; Roberto Navigli

doi:10.5281/zenodo.6625312

Published June 8, 2022 | Version v1

Conference paper Open

DIBIMT: A Novel Benchmark for Measuring Word Sense Disambiguation Biases in Machine Translation

1. Sapienza University of Rome
2. SSML Carlo Bo

Lexical ambiguity poses one of the greatest challenges in the field of Machine Translation. Over the last few decades, multiple efforts have been undertaken to investigate incorrect translations caused by the polysemous nature of words. Within this body of research, some studies have posited that models pick up semantic biases existing in the training data, thus producing translation errors. In this paper, we present DIBIMT, the first entirely manuallycurated evaluation benchmark which enables an extensive study of semantic biases in Machine Translation of nominal and verbal words in five different language combinations, namely, English and one or other of the following languages: Chinese, German, Italian, Russian and Spanish. Furthermore, we test state-of-the-art Machine Translation systems, both commercial and non-commercial ones, against our new test bed and provide a thorough statistical and linguistic analysis of the results. We release DIBIMT at https:// nlp.uniroma1.it/dibimt as a closed benchmark with a public leaderboard.

Files

dibimt_acl2022.pdf

Files (625.7 kB)

Name	Size	Download all
dibimt_acl2022.pdf md5:af6d143b0cf06ac42327dbb29d4d6141	625.7 kB	Preview Download

Additional details

European Commission
ELEXIS - European Lexicographic Infrastructure 731015

	All versions	This version
Views	132	132
Downloads	136	136
Data volume	88.8 MB	88.8 MB

DIBIMT: A Novel Benchmark for Measuring Word Sense Disambiguation Biases in Machine Translation

Authors/Creators

Description

Files

dibimt_acl2022.pdf

Files (625.7 kB)

Additional details

Funding