Robust Neural Machine Translation for Clean and Noisy Speech Transcripts

Di Gangi, Matti; Enyedi, Robert; Brusadin, Alessandra; Federico, Marcello

doi:10.5281/zenodo.3524947

Published November 2, 2019 | Version v1

Conference paper Open

Robust Neural Machine Translation for Clean and Noisy Speech Transcripts

1. Fondazione Bruno Kessler, Trento, Italy & University of Trento, Italy
2. Amazon AI, East Palo Alto, USA

Neural machine translation models have shown to achieve high quality when trained and fed with well structured and punctuated input texts. Unfortunately, the latter condition is not met in spoken language translation, where the input is generated by an automatic speech recognition (ASR) system. In this paper, we study how to adapt a strong NMT system to make it robust to typical ASR errors. As in our application scenarios transcripts might be post-edited by human experts, we propose adaptation strategies to train a single system that can translate either clean or noisy input with no supervision on the input type. Our experimental results on a public speech translation data set show that adapting a model on a significant amount of parallel data including ASR transcripts is beneficial with test data of the same type, but produces a small degradation when translating clean text. Adapting on both clean and noisy variants of the same data leads to the best results on both input types.

Files

IWSLT2019_paper_3.pdf

Files (139.8 kB)

Name	Size	Download all
IWSLT2019_paper_3.pdf md5:3d5686aed1fb4127847cffda9e8a5f5d	139.8 kB	Preview Download

	All versions	This version
Views	1,575	1,569
Downloads	288	288
Data volume	43.2 MB	43.2 MB

Robust Neural Machine Translation for Clean and Noisy Speech Transcripts

Authors/Creators

Description

Files

IWSLT2019_paper_3.pdf

Files (139.8 kB)