Published December 11, 2025 | Version v2
Publication Open

First Documented Proof of Cross-Vendor AI Collaboration on a Benchmark: Multi-AI Consensus Achieves Best-Ever 50% on IMO 2025

  • 1. NameONE Studios inc

Contributors

Contact person:

Description

First documented instance of multiple frontier AI systems from different vendors

(Claude, GPT-4, Grok, Gemini, DeepSeek, Kimi) collaborating in real-time to solve mathematical olympiad problems. Achieved 50% accuracy (3/6 problems correct) on IMO 2025, an 18.4 percentage point improvement over Gemini baseline. Notably, Gemini alone solved 0/6 problems in our trials, with all three correct answers emerging from cross-AI collaboration and consensus voting. This work demonstrates that multi-vendor AI collaboration can exceed individual model performance on the hardest mathematical reasoning benchmarks, and introduces a novel "Family Game Night" protocol for fallback reasoning when primary models fail.

Files

HyperNet_IMO_2025_MultiAI_Collaboration.pdf

Files (12.2 kB)

Name Size Download all
md5:71657bb5a6f1283b684b0881c4f758e3
12.2 kB Preview Download

Additional details

Dates

Issued
2025-11-29
HyperNet