First Documented Proof of Cross-Vendor AI Collaboration on a Benchmark: Multi-AI Consensus Achieves Best-Ever 50% on IMO 2025

Kawa, Steven

doi:10.5281/zenodo.17903905

Published December 11, 2025 | Version v2

Publication Open

First Documented Proof of Cross-Vendor AI Collaboration on a Benchmark: Multi-AI Consensus Achieves Best-Ever 50% on IMO 2025

Kawa, Steven (Contact person)¹

1. NameONE Studios inc

Contributors

Contact person:

Kawa, Steven

First documented instance of multiple frontier AI systems from different vendors

(Claude, GPT-4, Grok, Gemini, DeepSeek, Kimi) collaborating in real-time to solve mathematical olympiad problems. Achieved 50% accuracy (3/6 problems correct) on IMO 2025, an 18.4 percentage point improvement over Gemini baseline. Notably, Gemini alone solved 0/6 problems in our trials, with all three correct answers emerging from cross-AI collaboration and consensus voting. This work demonstrates that multi-vendor AI collaboration can exceed individual model performance on the hardest mathematical reasoning benchmarks, and introduces a novel "Family Game Night" protocol for fallback reasoning when primary models fail.

Files

HyperNet_IMO_2025_MultiAI_Collaboration.pdf

Files (12.2 kB)

Name	Size	Download all
HyperNet_IMO_2025_MultiAI_Collaboration.pdf md5:71657bb5a6f1283b684b0881c4f758e3	12.2 kB	Preview Download

Additional details

Issued: 2025-11-29

HyperNet

Views

Downloads

Show more details

	All versions	This version
Views	74	43
Downloads	26	24
Data volume	514.2 kB	464.1 kB

More info on how stats are collected....

DOI

Resource type

Publication

Publisher

Zenodo

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more
Copyright: NameONE Studios inc.

Technical metadata

Created: December 11, 2025
Modified: December 11, 2025

First Documented Proof of Cross-Vendor AI Collaboration on a Benchmark: Multi-AI Consensus Achieves Best-Ever 50% on IMO 2025

Authors/Creators

Contributors

Contact person:

Description

Files

HyperNet_IMO_2025_MultiAI_Collaboration.pdf

Files (12.2 kB)

Additional details

Dates