Conference paper Open Access
Pham, Ngoc-Quan; Nguyen, Thai-Son; Ha, Thanh-Le; Hussain, Juan; Schneider, Felix; Niehues, Jan; Stüker, Sebastian; Waibel, Alexander
This paper describes KIT’s submission to the IWSLT 2019 Speech Translation task on two sub-tasks corresponding to two different datasets. We investigate different end-to-end architectures for the speech recognition module, including our new transformer-based architectures. Overall, our modules in the pipe-line are based on the transformer architecture which has recently achieved great results in various fields. In our systems, using transformer is also advantageous compared to traditional hybrid systems in term of simplicity while still having competent results.