How does the accuracy of Gemini 1.5 Pro on the MMMU benchmark compare to MoE-LLaVA and dense LLaVA-1.5 when ev

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20439435

Published May 29, 2026 | Version v1

Report Open

How does the accuracy of Gemini 1.5 Pro on the MMMU benchmark compare to MoE-LLaVA and dense LLaVA-1.5 when ev

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-Gemini, a family of highly capable multimodal models that are specialized in medicine with the ability to seamlessly use web search, and that can be efficiently tailored to novel modalities using custo

Research goal: How does the accuracy of Gemini 1.5 Pro on the MMMU benchmark compare to MoE-LLaVA and dense LLaVA-1.5 when evaluated under a fixed 4K token context window?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.9/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.9/10.

Files

paper.pdf

Files (93.6 kB)

Name	Size	Download all
paper.pdf md5:4094e0f790a2209b8ad4b1b164c02b60	93.6 kB	Preview Download

	All versions	This version
Views	2	2
Downloads	1	1
Data volume	93.6 kB	93.6 kB

How does the accuracy of Gemini 1.5 Pro on the MMMU benchmark compare to MoE-LLaVA and dense LLaVA-1.5 when ev

Authors/Creators

Description

Notes

Files

paper.pdf

Files (93.6 kB)