Lexical Micro-adaptation for Neural Machine Translation

doi:10.5281/zenodo.3524977

Published November 2, 2019 | Version v1

Conference paper Open

Lexical Micro-adaptation for Neural Machine Translation

1. SYSTRAN, 5 rue Feydeau, 75002 Paris (France)

This work is inspired by a typical machine translation industry scenario in which translators make use of in-domain data for facilitating translation of similar or repeating sentences. We introduce a generic framework applied at inference in which a subset of segment pairs are first extracted from training data according to their similarity to the input sentences. These segments are then used to dynamically update the parameters of a generic NMT network, thus performing a lexical micro-adaptation. Our approach demonstrates strong adaptation performance to new and existing datasets including pseudo in-domain data. We evaluate our approach on a heterogeneous English-French training dataset showing accuracy gains on all evaluated domains when compared to strong adaptation baselines.

Files

IWSLT2019_paper_9.pdf

Files (432.3 kB)

Name	Size	Download all
IWSLT2019_paper_9.pdf md5:b145694f9a8fd725c74837681d966e22	432.3 kB	Preview Download

265

Views

145

Downloads

Show more details

	All versions	This version
Views	265	264
Downloads	145	144
Data volume	69.6 MB	69.2 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Languages

English

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 1, 2019
Modified: July 22, 2024

Lexical Micro-adaptation for Neural Machine Translation

Creators

Description

Files

IWSLT2019_paper_9.pdf

Files (432.3 kB)