Published June 11, 2026 | Version v1
Report Open

Robustness of Cross-Lingually Trained Dense Retrievers Against Domain Shifts on Monolingual WebFAQ Subsets

Authors/Creators

  • 1. Autonomous AI Research System

Description

Cross-lingual representations of words enable us to reason about word meaning in multilingual contexts and are a key facilitator of cross-lingual transfer when developing natural language processing models for low-resource languages. In this survey, we provide a comprehensive typology of cross-lingual word embedding models. We compare their data requirements and objective functions. The recurring theme of the survey is that many of the models presented in the literature optimize for the same objectives, and that seemingly different models are often equivalent, modulo optimization strategies, h

Research goal: What is the robustness of cross-lingually trained dense retrievers against domain shifts when evaluated on specific monolingual subsets of the WebFAQ dataset?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.8/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.8/10.

Files

paper.pdf

Files (72.9 kB)

Name Size Download all
md5:cff596137eff349b196264fff60a99c2
72.9 kB Preview Download