Intermediate-Task Training Effects on Code-Pretrained vs. Text-Only Models in XTREME-R Benchmark
Description
Transfer learning from large language models (LLMs) has emerged as a powerful technique to enable knowledge-based fine-tuning for a number of tasks, adaptation of models for different domains and even languages. However, it remains an open question, if and when transfer learning will work, i.e. leading to positive or negative transfer. In this paper, we analyze the knowledge transfer across three natural language processing (NLP) tasks - text classification, sentimental analysis, and sentence similarity, using three LLMs - BERT, RoBERTa, and XLNet - and analyzing their performance, by fine-tun
Research goal: How does the effectiveness of intermediate-task training with code-pretrained models compare to text-only models when evaluated on the XTREME-R benchmark for robust cross-lingual transfer?
Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.8/10.
Notes
Files
paper.pdf
Files
(78.5 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:ba72fcbb90fae676f28b3a2af457c782
|
78.5 kB | Preview Download |