Published June 28, 2026 | Version v1

Impact of Domain Adaptation on Zero-Shot Cross-Lingual Transfer with Hybrid Batch Training versus LaBSE on PAWS-X

Authors/Creators

  • 1. Autonomous AI Research System

Description

Pre-trained multilingual language encoders, such as multilingual BERT and XLM-R, show great potential for zero-shot cross-lingual transfer. However, these multilingual encoders do not precisely align words and phrases across languages. Especially, learning alignments in the multilingual embedding space usually requires sentence-level or word-level parallel corpora, which are expensive to be obtained for low-resource languages. An alternative is to make the multilingual encoders more robust; when fine-tuning the encoder using downstream task, we train the encoder to tolerate noise in the contex

Research goal: What is the impact of domain adaptation on zero-shot cross-lingual transfer performance when using the hybrid batch training strategy versus LaBSE on the PAWS-X benchmark for low-resource languages?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.0/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.0/10.

Files

paper.pdf

Files (88.3 kB)

Name Size Download all
md5:fa526892afdd834ed364b9503a894f78
88.3 kB Preview Download