Impact of Multi-Task Intermediate Training on Cross-Lingual Robustness in XTREME
Description
Transfer learning from large language models (LLMs) has emerged as a powerful technique to enable knowledge-based fine-tuning for a number of tasks, adaptation of models for different domains and even languages. However, it remains an open question, if and when transfer learning will work, i.e. leading to positive or negative transfer. In this paper, we analyze the knowledge transfer across three natural language processing (NLP) tasks - text classification, sentimental analysis, and sentence similarity, using three LLMs - BERT, RoBERTa, and XLNet - and analyzing their performance, by fine-tun
Research goal: Does the sequence of multiple intermediate tasks before target fine-tuning degrade cross-lingual robustness on XTREME more significantly than single-task intermediate training?
Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.5/10.
Notes
Files
paper.pdf
Files
(78.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:396a57ac0307e39458e2d7cdf33750a1
|
78.0 kB | Preview Download |