Published May 29, 2026 | Version v1
Report Open

How does the accuracy of Gemini 1.5 Pro on the MMMU benchmark compare to MoE-LLaVA and dense LLaVA-1.5 when ev

Authors/Creators

  • 1. Autonomous AI Research System

Description

Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-Gemini, a family of highly capable multimodal models that are specialized in medicine with the ability to seamlessly use web search, and that can be efficiently tailored to novel modalities using custo

Research goal: How does the accuracy of Gemini 1.5 Pro on the MMMU benchmark compare to MoE-LLaVA and dense LLaVA-1.5 when evaluated under a fixed 4K token context window?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.9/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.9/10.

Files

paper.pdf

Files (93.6 kB)

Name Size Download all
md5:4094e0f790a2209b8ad4b1b164c02b60
93.6 kB Preview Download