Conference paper Open Access

Analysis of Positional Encodings for Neural Machine Translation

Rosendahl, Jan; Tran, Viet Anh Khoa; Wang, Weiyue; Ney, Hermann

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Rosendahl, Jan</dc:creator>
  <dc:creator>Tran, Viet Anh Khoa</dc:creator>
  <dc:creator>Wang, Weiyue</dc:creator>
  <dc:creator>Ney, Hermann</dc:creator>
  <dc:description>In this work we analyze and compare the behavior of the Transformer architecture when using different positional encoding methods. While absolute and relative positional encoding perform equally strong overall, we show that relative positional encoding is vastly superior (4.4% to 11.9% BLEU) when translating a sentence that is longer than any observed training sentence. We further propose and analyze variations of relative positional encoding and observe that the number of trainable parameters can be reduced without a performance loss, by using fixed encoding vectors or by removing some of the positional encoding vectors.</dc:description>
  <dc:title>Analysis of Positional Encodings for Neural Machine Translation</dc:title>
All versions This version
Views 140141
Downloads 127126
Data volume 61.7 MB61.2 MB
Unique views 128129
Unique downloads 120119


Cite as