Integration of machine translation in on-line multilingual applications: Domain adaptation

doi:10.5281/zenodo.1291936

Published June 18, 2018 | Version v1

Book chapter Open

Integration of machine translation in on-line multilingual applications: Domain adaptation

Large amounts of bilingual corpora are used in the training process of statistical
machine translation systems. Usually a general domain is used as the training corpus. When the system is tested using data from the same domain, the obtained
results are satisfactory, but if the test set belongs to a different domain, the trans-
lation quality decreases. This is due to insufficient lexical coverage, wrong choice
in case of polysemous words, and differences in discourse style between the two
domains. Thus, the need to adapt the system is an ongoing research task in ma-
chine translation. Some challenges in performing domain adaptation are to decide
which part of the system requires adaptation and to choose what method needs to
be applied. In this paper, we used language model interpolation as a domain adaptation method and proved that it is a fast state of the art method that can be used in
building adapted translation systems even when sparse domain specific material
is available (i.e. especially in the case of low-resourced language pairs). The best
improvement was of 15 bleu points over the baseline system.

Files

7.pdf

Files (395.5 kB)

Name	Size	Download all
7.pdf md5:790b274f1937f8edabee808ff664d731	395.5 kB	Preview Download

132

Views

Downloads

Show more details

	All versions	This version
Views	132	132
Downloads	44	44
Data volume	18.6 MB	18.6 MB

More info on how stats are collected....

DOI

Resource type

Book chapter

Publisher

Language Science Press

Imprint

Language technologies for a multilingual Europe, 103-121. Berlin. ISBN: 978-3-946234-73-9.

Languages

English

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: June 29, 2018
Modified: August 2, 2024

Integration of machine translation in on-line multilingual applications: Domain adaptation

Creators

Description

Files

7.pdf

Files (395.5 kB)