Cross-Lingual Fine-Tuning for Dense Retrieval on Non-English Misspellings in the LoCoMo Benchmark

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20638032

Published June 11, 2026 | Version v1

Report Open

Cross-Lingual Fine-Tuning for Dense Retrieval on Non-English Misspellings in the LoCoMo Benchmark

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Effective cross-lingual dense retrieval methods that rely on multilingual pre-trained language models (PLMs) need to be trained to encompass both the relevance matching task and the cross-language alignment task. However, cross-lingual data for training is often scarcely available. In this paper, rather than using more cross-lingual data for training, we propose to use cross-lingual query generation to augment passage representations with queries in languages other than the original passage language. These augmented representations are used at inference time so that the representation can enco

Research goal: What is the impact of cross-lingual fine-tuning on dense retrieval performance when evaluated on the LoCoMo benchmark with non-English misspellings?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.5/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.5/10.

Files

paper.pdf

Files (87.6 kB)

Name	Size	Download all
paper.pdf md5:18d95ebae3c33a2ce01ca72eafd1a148	87.6 kB	Preview Download

	All versions	This version
Views	2	2
Downloads	1	1
Data volume	175.2 kB	175.2 kB

Cross-Lingual Fine-Tuning for Dense Retrieval on Non-English Misspellings in the LoCoMo Benchmark

Authors/Creators

Description

Notes

Files

paper.pdf

Files (87.6 kB)