CMU's Machine Translation System for IWSLT 2019

Srinivasan, Tejas; Sanabria, Ramon; Metze, Florian

doi:10.5281/zenodo.3525531

Published November 2, 2019 | Version v1

Conference paper Open

CMU's Machine Translation System for IWSLT 2019

1. Language Technologies Institute, Carnegie Mellon University, USA

In Neural Machine Translation (NMT) the usage of sub-􏰃words and characters as source and target units offers a simple and flexible solution for translation of rare and unseen􏰃 words. However, selecting the optimal subword segmentation involves a trade-off between expressiveness and flexibility, and is language and dataset-dependent. We present Block Multitask Learning (BMTL), a novel NMT architecture that predicts multiple targets of different granularities simulta- neously, removing the need to search for the optimal seg- mentation strategy. Our multi-task model exhibits improvements of up to 1.7 BLEU points on each decoder over single-task baseline models with the same number of parameters on datasets from two language pairs of IWSLT15 and one from IWSLT19. The multiple hypotheses generated at different granularities can also be combined as a post-processing step to give better translations.

Files

IWSLT2019_paper_33.pdf

Files (844.0 kB)

Name	Size	Download all
IWSLT2019_paper_33.pdf md5:c3dcd0b82c35d844e7fcb4a2ae05bde1	844.0 kB	Preview Download

386

Views

169

Downloads

Show more details

	All versions	This version
Views	386	384
Downloads	169	169
Data volume	151.1 MB	151.1 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 2, 2019
Modified: July 22, 2024

CMU's Machine Translation System for IWSLT 2019

Authors/Creators

Description

Files

IWSLT2019_paper_33.pdf

Files (844.0 kB)