Using Whole Document Context in Neural Machine Translation

Macé, Valentin; Servan, Christophe

doi:10.5281/zenodo.3525020

Published November 2, 2019 | Version v1

Conference paper Open

Using Whole Document Context in Neural Machine Translation

1. QWANT RESEARCH - 7 Rue Spontini, 75116 Paris, France

In Machine Translation, considering the document as a whole can help to resolve ambiguities and inconsistencies. In this paper, we propose a simple yet promising approach to add contextual information in Neural Machine Translation. We present a method to add source context that capture the whole document with accurate boundaries, taking every word into account. We provide this additional information to a Transformer model and study the impact of our method on three language pairs. The proposed approach obtains promising results in the English-German, English-French and French-English document-level translation tasks. We observe interesting cross-sentential behaviors where the model learns to use document-level information to improve translation coherence.

Files

IWSLT2019_paper_20.pdf

Files (168.3 kB)

Name	Size	Download all
IWSLT2019_paper_20.pdf md5:3a4a4614e5f77e181163dafd99370339	168.3 kB	Preview Download

Citations

Oops! Something went wrong while fetching results.

171

Views

105

Downloads

Show more details

	All versions	This version
Views	171	170
Downloads	105	105
Data volume	19.0 MB	19.0 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Languages

English

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 1, 2019
Modified: July 22, 2024

Using Whole Document Context in Neural Machine Translation

Creators

Description

Files

IWSLT2019_paper_20.pdf

Files (168.3 kB)