Analysis of Positional Encodings for Neural Machine Translation

Rosendahl, Jan; Tran, Viet Anh Khoa; Wang, Weiyue; Ney, Hermann

doi:10.5281/zenodo.3525024

Published November 2, 2019 | Version v1

Conference paper Open

Analysis of Positional Encodings for Neural Machine Translation

1. Human Language Technology and Pattern Recognition Group, RWTH Aachen University, Germany

In this work we analyze and compare the behavior of the Transformer architecture when using different positional encoding methods. While absolute and relative positional encoding perform equally strong overall, we show that relative positional encoding is vastly superior (4.4% to 11.9% BLEU) when translating a sentence that is longer than any observed training sentence. We further propose and analyze variations of relative positional encoding and observe that the number of trainable parameters can be reduced without a performance loss, by using fixed encoding vectors or by removing some of the positional encoding vectors.

Files

IWSLT2019_paper_21.pdf

Files (485.7 kB)

Name	Size	Download all
IWSLT2019_paper_21.pdf md5:a2422d4b74a9634b2567ac4e9f630065	485.7 kB	Preview Download

371

Views

247

Downloads

Show more details

	All versions	This version
Views	371	360
Downloads	247	247
Data volume	126.8 MB	126.8 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 1, 2019
Modified: July 22, 2024

Analysis of Positional Encodings for Neural Machine Translation

Creators

Description

Files

IWSLT2019_paper_21.pdf

Files (485.7 kB)