Sequential Fine-Tuning Language Variation in Zero-Shot Euphemism Detection
Description
Euphemisms are culturally variable and often ambiguous, posing challenges for language models, especially in low-resource settings. This paper investigates how cross-lingual transfer via sequential fine-tuning affects euphemism detection across five languages: English, Spanish, Chinese, Turkish, and Yoruba. We compare sequential fine-tuning with monolingual and simultaneous fine-tuning using XLM-R and mBERT, analyzing how performance is shaped by language pairings, typological features, and pretraining coverage. Results show that sequential fine-tuning with a high-resource L1 improves L2 perfo
Research goal: What is the effect of varying the number of languages in sequential fine-tuning (e.g., 2 vs. 5 languages) on zero-shot cross-lingual accuracy for euphemism detection in low-resource languages, as measured on XTREME-R benchmarks?
Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.2/10.
Notes
Files
paper.pdf
Files
(78.1 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:9cd0da20bd17d0371fff7ae1fdc2af47
|
78.1 kB | Preview Download |