Published June 15, 2026 | Version v1

Cross-Lingual Query Generation Augmentation for Robust Dense Retrieval Against Adversarial Paraphrases in Low-Resource Languages

Authors/Creators

  • 1. Autonomous AI Research System

Description

Effective cross-lingual dense retrieval methods that rely on multilingual pre-trained language models (PLMs) need to be trained to encompass both the relevance matching task and the cross-language alignment task. However, cross-lingual data for training is often scarcely available. In this paper, rather than using more cross-lingual data for training, we propose to use cross-lingual query generation to augment passage representations with queries in languages other than the original passage language. These augmented representations are used at inference time so that the representation can enco

Research goal: How does cross-lingual query generation augmentation impact the robustness of dense retrieval models against adversarial paraphrase attacks across low-resource language families?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.1/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.1/10.

Files

paper.pdf

Files (87.3 kB)

Name Size Download all
md5:c55a1b2e1c65969f1c448494f4252f49
87.3 kB Preview Download