Cross-Lingual Performance of Block-Sparse FlashAttention in Noisy MLQA Benchmarking
Description
Question answering (QA) models have shown rapid progress enabled by the availability of large, high-quality benchmark datasets. Such annotated datasets are difficult and costly to collect, and rarely exist in languages other than English, making training QA systems in other languages challenging. An alternative to building large monolingual training datasets is to develop cross-lingual systems which can transfer to a target language without requiring training data in that language. In order to develop such systems, it is crucial to invest in high quality multilingual evaluation benchmarks to m
Research goal: How does the cross-lingual performance of Block-Sparse FlashAttention compare to other attention mechanisms (e.g., Longformer, Reformer) when evaluated on the MLQA benchmark under varying levels of input noise?
Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.5/10.
Notes
Files
paper.pdf
Files
(83.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:9c398307f83aa3828382271c911c1a12
|
83.0 kB | Preview Download |