Published December 8, 2020 | Version v1
Conference paper Open

Integrating Domain Terminology into Neural Machine Translation

  • 1. SYSTRAN

Description

This paper extends existing work on terminology integration into Neural Machine Translation, a common industrial practice to dynamically adapt translation to a specific domain. Our method, based on the use of placeholders complemented with morphosyntactic annotation, efficiently taps into the ability of the neural network to deal with symbolic knowledge to surpass the surface generalization shown by alternative techniques. We compare our approach to state-of-the-art systems and benchmark them through a well-defined evaluation framework, focusing on actual application of terminology and not just on the overall performance. Results indicate the suitability of our method in the use-case where terminology is used in a system trained on generic data only.

Files

2020.coling-main.348.pdf

Files (268.4 kB)

Name Size Download all
md5:fc8e91b7e16396d94491fe75542f0378
268.4 kB Preview Download

Additional details

Related works

Is derived from
Conference paper: 10.18653/v1/2020.coling-main.348 (DOI)

Funding

ANITA – Advanced tools for fighting oNline Illegal TrAfficking 787061
European Commission