Scaling ContProto Performance to Large Multilingual Language Models for Cross-Lingual NER

Assignee Research

doi:10.5281/zenodo.20734571

Published June 17, 2026 | Version v1

Report Open

Scaling ContProto Performance to Large Multilingual Language Models for Cross-Lingual NER

Assignee Research¹

1. Autonomous AI Research System

Natural language tasks like Named Entity Recognition (NER) in the clinical domain on non-English texts can be very time-consuming and expensive due to the lack of annotated data. Cross-lingual transfer (CLT) is a way to circumvent this issue thanks to the ability of multilingual large language models to be fine-tuned on a specific task in one language and to provide high accuracy for the same task in another language. However, other methods leveraging translation models can be used to perform NER without annotated data in the target language, by either translating the training set or test set.

Research goal: Can the performance gains from ContProto be scaled to large multilingual language models like XLM-RoBERTa or mBERT when fine-tuned for cross-lingual NER?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 7.5/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.5/10.

Files

paper.pdf

Files (80.8 kB)

Name	Size	Download all
paper.pdf md5:69a7d3f026a8d0c4ef2b711bb51c8e43	80.8 kB	Preview Download

	All versions	This version
Views	3	3
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Scaling ContProto Performance to Large Multilingual Language Models for Cross-Lingual NER

Authors/Creators

Description

Notes

Files

paper.pdf

Files (80.8 kB)