Cross-lingual Transfer Accuracy on XTREME-B with Logographic vs. Alphabetic Source Languages
Description
Naively assuming English as a source language may hinder cross-lingual transfer for many languages by failing to consider the importance of language contact. Some languages are more well-connected than others, and target languages can benefit from transferring from closely related languages; for many languages, the set of closely related languages does not include English. In this work, we study the impact of source language for cross-lingual transfer, demonstrating the importance of selecting source languages that have high contact with the target language. We also construct a novel benchmark
Research goal: How does zero-shot cross-lingual transfer accuracy on XTREME-B vary when using non-English source languages with logographic scripts compared to alphabetic scripts?
Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.5/10.
Notes
Files
paper.pdf
Files
(87.8 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:78a112ebf24730f29fe35e097248f8cb
|
87.8 kB | Preview Download |