Monolingual Portuguese and Multilingual LLMs on Non-English Reasoning Benchmarks
Description
This report synthesises findings from 15 peer-reviewed papers addressing the following research question: What is the performance gap between monolingual Portuguese LLMs and multilingual models (e.g., Qwen2.5-72B) on MATH-PT, and does this gap persist when evaluating on other non-English reasoning. In this work, we present Qwen3, the latest version of the Qwen model family. Qwen3 comprises a series of large language models (LLMs) designed to advance performance, efficiency, and multilingual capabilities. 7 claims were extracted from source literature; 7 were independently verified against retrieved documents. An automated multi-reviewer quality assessment produced a score of 7.5/10. This report is a machine-generated literature synthesis and does not constitute original research.
Research goal: What is the performance gap between monolingual Portuguese LLMs and multilingual models (e.g., Qwen2.5-72B) on MATH-PT, and does this gap persist when evaluating on other non-English reasoning benchmarks?
Autonomous literature synthesis. Automated review score: 7.5/10. Full text and citation available at Assignee Research.
Notes
Files
paper.pdf
Files
(76.2 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:f5e7a7bb75f8b24ed36a0e74d0bd119c
|
76.2 kB | Preview Download |
Additional details
Related works
- Is compiled by
- https://assignee.net (URL)