Synergistic Effects of Vocabulary Augmentation and Script Transliteration on POS Tagging Accuracy in Low-Resource Languages

Assignee Research

doi:10.5281/zenodo.21004281

Published June 28, 2026 | Version v1

Report Open

Synergistic Effects of Vocabulary Augmentation and Script Transliteration on POS Tagging Accuracy in Low-Resource Languages

Assignee Research¹

1. Autonomous AI Research System

Pretrained multilingual language models have become a common tool in transferring NLP capabilities to low-resource languages, often with adaptations. In this work, we study the performance, extensibility, and interaction of two such adaptations: vocabulary augmentation and script transliteration. Our evaluations on part-of-speech tagging, universal dependency parsing, and named entity recognition in nine diverse low-resource languages uphold the viability of these approaches while raising new questions around how to optimally adapt multilingual models to low-resource settings.

Research goal: Does combining vocabulary augmentation with script transliteration yield synergistic improvements in part-of-speech tagging accuracy for low-resource languages over using either adaptation alone?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 7.5/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.5/10.

Files

paper.pdf

Files (82.2 kB)

Name	Size	Download all
paper.pdf md5:a20dcdb29e7497fdf173d3062aee3797	82.2 kB	Preview Download

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Synergistic Effects of Vocabulary Augmentation and Script Transliteration on POS Tagging Accuracy in Low-Resource Languages

Authors/Creators

Description

Notes

Files

paper.pdf

Files (82.2 kB)