MCP in the Wild: Cross-Domain Knowledge Discovery through Multi-Server Orchestration
Authors/Creators
Description
The first empirical study of multi-server Model Context Protocol (MCP) orchestration with a 7-model cross-domain synthesis benchmark. Seventeen real MCP tool calls across six servers (arXiv, PubMed, Firecrawl, Context7, Memory, Filesystem) produced nine cross-domain insights. Seven LLMs were benchmarked (GPT-5.4, DeepSeek R1, Mistral Large 3, Llama 4 Maverick, Gemini 2.5 Flash, Claude Sonnet 4.5, Claude Haiku 4.5) on identical data. All seven independently identified the mechanism-pattern gap: composition patterns for multi-server MCP are undocumented. Five patterns were proposed.
Files
paper-final.pdf
Files
(70.7 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:5e8e3a77133935ddcb7e9e96055adafc
|
70.7 kB | Preview Download |
Additional details
Dates
- Created
-
2026-03-09
Software
- Repository URL
- https://github.com/doganarif/mcp-bench
- Programming language
- Python
- Development Status
- Active