ANALYZING ARCHITECTURES FOR NEURAL MACHINE TRANSLATION USING LOW COMPUTATIONAL RESOURCES

doi:10.5281/zenodo.6523631

Published May 6, 2022 | Version v1

Journal article Open

ANALYZING ARCHITECTURES FOR NEURAL MACHINE TRANSLATION USING LOW COMPUTATIONAL RESOURCES

1. SCTR's Pune Institute of Computer Technology

With the recent developments in the field of Natural Language Processing, there has been a rise in the use
of different architectures for Neural Machine Translation. Transformer architectures are used to achieve
state-of-the-art accuracy, but they are very computationally expensive to train. Everyone cannot have such
setups consisting of high-end GPUs and other resources. We train our models on low computational
resources and investigate the results. As expected, transformers outperformed other architectures, but
there were some surprising results. Transformers consisting of more encoders and decoders took more
time to train but had fewer BLEU scores. LSTM performed well in the experiment and took comparatively
less time to train than transformers, making it suitable to use in situations having time constraints.

Files

10521ijnlc02.pdf

Files (540.4 kB)

Name	Size	Download all
10521ijnlc02.pdf md5:e73dca7839d79ca6e3206078bc6afed7	540.4 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	80	80
Downloads	37	37
Data volume	20.0 MB	20.0 MB

More info on how stats are collected....

DOI

Resource type

Journal article

Publisher

Zenodo

Published in

International Journal on Natural Language Computing (IJNLC), 10(5), 8, 2022.

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: May 6, 2022
Modified: July 16, 2024

ANALYZING ARCHITECTURES FOR NEURAL MACHINE TRANSLATION USING LOW COMPUTATIONAL RESOURCES

Creators

Description

Files

10521ijnlc02.pdf

Files (540.4 kB)