Multilingual Pre-trained Language Models in Cross-lingual NER Alignment for Zero-Shot Transfer

Assignee Research

doi:10.5281/zenodo.20947943

Published June 27, 2026 | Version v1

Report Open

Multilingual Pre-trained Language Models in Cross-lingual NER Alignment for Zero-Shot Transfer

Assignee Research¹

1. Autonomous AI Research System

We propose a novel approach for cross-lingual Named Entity Recognition (NER) zero-shot transfer using parallel corpora. We built an entity alignment model on top of XLM-RoBERTa to project the entities detected on the English part of the parallel data to the target language sentences, whose accuracy surpasses all previous unsupervised models. With the alignment model we can get pseudo-labeled NER data set in the target language to train task-specific model. Unlike using translation methods, this approach benefits from natural fluency and nuances in target-language original corpus. We also propo

Research goal: What is the effect of incorporating multilingual pre-trained language models (e.g., XLM-RoBERTa, mBERT) as teachers in cross-lingual NER alignment methods on the F1 score performance for target languages with no labeled data?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.5/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.5/10.

Files

paper.pdf

Files (87.0 kB)

Name	Size	Download all
paper.pdf md5:50b72c25d2c606a238fea0a1ae479976	87.0 kB	Preview Download

	All versions	This version
Views	1	1
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Multilingual Pre-trained Language Models in Cross-lingual NER Alignment for Zero-Shot Transfer

Authors/Creators

Description

Notes

Files

paper.pdf

Files (87.0 kB)