Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation

doi:10.5281/zenodo.2275709

Published December 14, 2018 | Version v1

Journal article Open

Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation

1. Nuance Communications
2. Idiap Research Institute
3. HEIG-VD / HES-SO

This paper demonstrates that word sense disambiguation (WSD) can improve neural machine translation (NMT) by widening the source context considered when modeling the senses of potentially ambiguous words. We first introduce three adaptive clustering algorithms for WSD, based on k-means, Chinese restaurant processes, and random
walks, which are then applied to large word contexts represented in a low-rank space and evaluated on SemEval shared-task data. We then learn word vectors jointly with sense vectors defined by our best WSD method, within a state-of-the-art NMT system. We show that the concatenation of these vectors, and the use of a sense selection mechanism based on the weighted average of sense vectors, outperforms several baselines including sense-aware ones. This is demonstrated by translation on five language pairs. The improvements are above one BLEU point over strong NMT baselines, +4% accuracy over all ambiguous nouns and verbs, or +20% when scored manually over several challenging words.

Files

1507-Pu-finalversion.pdf

Files (519.0 kB)

Name	Size	Download all
1507-Pu-finalversion.pdf md5:22e23bdca719829731e902e4df7f7059	519.0 kB	Preview Download

Additional details

SUMMA – Scalable Understanding of Multilingual Media 688139: European Commission

	All versions	This version
Views	79	77
Downloads	89	88
Data volume	48.3 MB	47.7 MB

Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation

Creators

Description

Files

1507-Pu-finalversion.pdf

Files (519.0 kB)

Additional details

Funding