First Documented Proof of Cross-Vendor AI Collaboration on a Benchmark: Multi-AI Consensus Achieves Best-Ever 50% on IMO 2025

Kawa, Steven

doi:10.5281/zenodo.17903603

There is a newer version of the record available.

Published December 11, 2025 | Version 1.0

Publication Open

First Documented Proof of Cross-Vendor AI Collaboration on a Benchmark: Multi-AI Consensus Achieves Best-Ever 50% on IMO 2025

Kawa, Steven (Contact person)¹

1. NameONE Studios inc

Contributors

Contact person:

Kawa, Steven

First documented instance of multiple frontier AI systems from different vendors

(Claude, GPT-4, Grok, Gemini, DeepSeek, Kimi) collaborating in real-time to solve mathematical olympiad problems. Achieved 50% accuracy (3/6 problems correct) on IMO 2025, an 18.4 percentage point improvement over Gemini baseline. Notably, Gemini alone solved 0/6 problems in our trials, with all three correct answers emerging from cross-AI collaboration and consensus voting. This work demonstrates that multi-vendor AI collaboration can exceed individual model performance on the hardest mathematical reasoning benchmarks, and introduces a novel "Family Game Night" protocol for fallback reasoning when primary models fail.

Files

files.zip

Files (25.1 kB)

Name	Size	Download all
files.zip md5:18cae9fb57af2f2707479e983e9faded	25.1 kB	Preview Download

Additional details

Issued: 2025-11-29

HyperNet

Views

Downloads

Show more details

	All versions	This version
Views	62	26
Downloads	20	1
Data volume	428.1 kB	25.1 kB

More info on how stats are collected....

DOI

Resource type

Publication

Publisher

Zenodo

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more
Copyright: NameONE Studios inc.

Technical metadata

Created: December 11, 2025
Modified: December 11, 2025

First Documented Proof of Cross-Vendor AI Collaboration on a Benchmark: Multi-AI Consensus Achieves Best-Ever 50% on IMO 2025

Authors/Creators

Contributors

Contact person:

Description

Files

files.zip

Files (25.1 kB)

Additional details

Dates