How does the zero-shot cross-domain retrieval performance of MMICL compare to specialized multimodal models on

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20441333

Published May 29, 2026 | Version v1

Report Open

How does the zero-shot cross-domain retrieval performance of MMICL compare to specialized multimodal models on

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Strong Artificial Intelligence (Strong AI) or Artificial General Intelligence (AGI) with abstract reasoning ability is the goal of next-generation AI. Recent advancements in Large Language Models (LLMs), along with the emerging field of Multimodal Large Language Models (MLLMs), have demonstrated impressive capabilities across a wide range of multimodal tasks and applications. Particularly, various MLLMs, each with distinct model architectures, training data, and training stages, have been evaluated across a broad range of MLLM benchmarks. These studies have, to varying degrees, revealed differ

Research goal: How does the zero-shot cross-domain retrieval performance of MMICL compare to specialized multimodal models on TextCaps when evaluated using precision@K and mean average precision (mAP) metrics?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.5/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.5/10.

Files

paper.pdf

Files (83.4 kB)

Name	Size	Download all
paper.pdf md5:a1a7f13587a2b3b9f1416acb0ccbe4ef	83.4 kB	Preview Download

	All versions	This version
Views	2	2
Downloads	1	1
Data volume	83.4 kB	83.4 kB

How does the zero-shot cross-domain retrieval performance of MMICL compare to specialized multimodal models on

Authors/Creators

Description

Notes

Files

paper.pdf

Files (83.4 kB)