Task Diversity in Intermediate Fine-Tuning and Robustness of Zero-Shot Cross-Lingual Transfer in mXGLUE
Description
Pre-trained multilingual language encoders, such as multilingual BERT and XLM-R, show great potential for zero-shot cross-lingual transfer. However, these multilingual encoders do not precisely align words and phrases across languages. Especially, learning alignments in the multilingual embedding space usually requires sentence-level or word-level parallel corpora, which are expensive to be obtained for low-resource languages. An alternative is to make the multilingual encoders more robust; when fine-tuning the encoder using downstream task, we train the encoder to tolerate noise in the contex
Research goal: What is the impact of task diversity in intermediate fine-tuning on the robustness of zero-shot cross-lingual transfer across morphologically complex languages in the mXGLUE benchmark?
Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.0/10.
Notes
Files
paper.pdf
Files
(86.7 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:4fc023fb722a9ee7f4f64f327b3052b8
|
86.7 kB | Preview Download |