Conference paper Open Access

Using Whole Document Context in Neural Machine Translation

Macé, Valentin; Servan, Christophe

In Machine Translation, considering the document as a whole can help to resolve ambiguities and inconsistencies. In this paper, we propose a simple yet promising approach to add contextual information in Neural Machine Translation. We present a method to add source context that capture the whole document with accurate boundaries, taking every word into account. We provide this additional information to a Transformer model and study the impact of our method on three language pairs. The proposed approach obtains promising results in the English-German, English-French and French-English document-level translation tasks. We observe interesting cross-sentential behaviors where the model learns to use document-level information to improve translation coherence.

Files (168.3 kB)
Name Size
168.3 kB Download
All versions This version
Views 116116
Downloads 8585
Data volume 14.3 MB14.3 MB
Unique views 108108
Unique downloads 7878


Cite as