Effectiveness of Intermediate-Task Training for Zero-Shot Cross-Lingual Transfer Across Model Sizes

Assignee Research

doi:10.5281/zenodo.21016305

Published June 29, 2026 | Version v1

Report Open

Effectiveness of Intermediate-Task Training for Zero-Shot Cross-Lingual Transfer Across Model Sizes

Assignee Research¹

1. Autonomous AI Research System

Intermediate-task training---fine-tuning a pretrained model on an intermediate task before fine-tuning again on the target task---often improves model performance substantially on language understanding tasks in monolingual English settings. We investigate whether English intermediate-task training is still helpful on non-English target tasks. Using nine intermediate language-understanding tasks, we evaluate intermediate-task transfer in a zero-shot cross-lingual setting on the XTREME benchmark. We see large improvements from intermediate training on the BUCC and Tatoeba sentence retrieval tas

Research goal: Does the effectiveness of English intermediate-task training for zero-shot cross-lingual transfer scale with model size, as measured by accuracy improvements on XTREME across different model sizes (e.g., base vs. large vs. XL)?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 9.0/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 9.0/10.

Files

paper.pdf

Files (77.5 kB)

Name	Size	Download all
paper.pdf md5:b86907b2ce5a05c64d95829c6cbe7fc3	77.5 kB	Preview Download

	All versions	This version
Views	1	1
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Effectiveness of Intermediate-Task Training for Zero-Shot Cross-Lingual Transfer Across Model Sizes

Authors/Creators

Description

Notes

Files

paper.pdf

Files (77.5 kB)