The University of Edinburgh's English-Tamil and English-Inuktitut Submissions to the WMT20 News Translation Task
Description
We describe the University of Edinburgh’s submissions to the WMT20 news translation shared task for the low resource language pair English-Tamil and the mid-resource language pair English-Inuktitut. We use the neural machine translation transformer architecture for all submissions and explore a variety of techniques to improve translation quality to compensate for the lack of parallel training data. For the very low-resource English-Tamil, this involves exploring pretraining, using both language model objectives and translation using an unrelated high-resource language pair (German-English), and iterative backtranslation. For English-Inuktitut, we explore the use of multilingual systems, which, despite not being part of the primary submission, would have achieved the best results on the test set.
Files
2020.wmt-1.5.pdf
Files
(239.5 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:e58706613ec60d8cca00a17d1b140eb4
|
239.5 kB | Preview Download |