Comparison of mT5, XLM-R, and mBART in Preventing Target Language Collapse in Zero-Shot Cross-Lingual Summarization for
Description
Multi-lingual language models (LM), such as mBERT, XLM-R, mT5, mBART, have been remarkably successful in enabling natural language tasks in low-resource languages through cross-lingual transfer from high-resource ones. In this work, we try to better understand how such models, specifically mT5, transfer *any* linguistic and semantic knowledge across languages, even though no explicit cross-lingual signals are provided during pre-training. Rather, only unannotated texts from each language are presented to the model separately and independently of one another, and the model appears to implicitly
Research goal: How does mT5 compare to XLM-R and mBART in preventing target language collapse during zero-shot cross-lingual summarization across low-resource languages?
Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.5/10.
Notes
Files
paper.pdf
Files
(90.7 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:47774d86824ab1e5782f967ae3ec37a6
|
90.7 kB | Preview Download |