End-to-end Speech Translation System Description of LIT for IWSLT 2019

Tu, Mei; Liu, Wei; Wang, Lijie; Chen, Xiao; Wen, Xue

doi:10.5281/zenodo.3525548

Published November 2, 2019 | Version v1

Journal article Open

End-to-end Speech Translation System Description of LIT for IWSLT 2019

1. Speech Lab & Language Understanding Lab of Language Intelligence Team, Beijing

This paper describes our end-to-end speech translation system for the speech translation task of lectures and TED talks from English to German for IWSLT Evaluation 2019. We propose layer-tied self-attention for end-to-end speech translation. Our method takes advantage of sharing weights of speech encoder and text decoder. The representation of source speech and the representation of target text are coordinated layer by layer, so that the speech and text can learn a better alignment during the training procedure. We also adopt data augmentation to enhance the parallel speech-text corpus. The En-De experimental results show that our best model achieves 17.68 on tst2015. Our ASR achieves WER of 6.6% on TED-LIUM test set. The En-Pt model can achieve about 11.83 on the MuST-C dev set.

Files

IWSLT2019_paper_36.pdf

Files (441.9 kB)

Name	Size	Download all
IWSLT2019_paper_36.pdf md5:6631d0e8a342fa955195868701bc457f	441.9 kB	Preview Download

346

Views

245

Downloads

Show more details

	All versions	This version
Views	346	344
Downloads	245	245
Data volume	119.3 MB	119.3 MB

More info on how stats are collected....

DOI

Resource type

Journal article

Publisher

Zenodo

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 2, 2019
Modified: July 22, 2024

End-to-end Speech Translation System Description of LIT for IWSLT 2019

Creators

Description

Files

IWSLT2019_paper_36.pdf

Files (441.9 kB)