Openai And Google Outdo The Mathletes, But Not Each Other

1 month ago

AI models from OpenAI and Google DeepMind achieved golden badge scores successful nan 2025 International Math Olympiad (IMO), 1 of nan world’s oldest and astir challenging precocious schoolhouse level mathematics competitions, nan companies independently announced successful caller days.

The consequence underscores conscionable really accelerated AI systems are advancing, and yet, really evenly matched Google and OpenAI look to beryllium successful nan AI race. AI companies are competing fiercely for nan nationalist cognition of down up successful nan AI race: an intangible conflict of “vibes” that tin person large implications for securing apical AI talent. A batch of AI researchers travel from backgrounds successful competitory math, truthful benchmarks for illustration IMO mean much than others.

Last year, Google scored a metallic badge astatine IMO utilizing a “formal” system, meaning it required humans to construe problems into a machine‑readable format. This year, some OpenAI and Google entered “informal” systems into nan competition, which were capable to ingest questions and make proof‑based answers successful earthy language. Both companies declare their AI models scored higher than astir precocious schoolhouse students and Google’s AI exemplary from past year, without requiring immoderate human-machine translation.

In interviews pinch TechCrunch, researchers down OpenAI and Google’s IMO efforts claimed that these golden badge performances correspond breakthroughs astir AI reasoning models successful non-verifiable domains. While AI reasoning models thin to do good connected questions pinch straightforward answers, specified arsenic mathematics aliases coding tasks, these systems struggle connected tasks pinch much ambiguous solutions, specified arsenic buying a awesome chair aliases helping pinch analyzable research.

However, Google is raising questions astir really OpenAI conducted and announced its golden badge IMO performance. After all, if you’re going to participate AI models into a mathematics title for precocious schoolers, you mightiness arsenic good reason for illustration teenagers.

Shortly aft OpenAI announced its feat connected Saturday morning, Google DeepMind’s CEO and researchers took to societal media to slam OpenAI for announcing its gold‑medal prematurely — soon aft IMO announced which precocious schoolers had won nan title connected Friday nighttime — and for not having their model’s trial officially evaluated by IMO.

Btw arsenic an aside, we didn’t denote connected Friday because we respected nan IMO Board's original petition that each AI labs stock their results only aft nan charismatic results had been verified by independent experts & nan students had rightly received nan acclamation they deserved

— Demis Hassabis (@demishassabis) July 21, 2025

Thang Luong, a Google DeepMind elder interrogator and lead for nan IMO project, told TechCrunch that Google waited to denote its IMO results to respect nan students participating successful nan competition.

Techcrunch event

San Francisco | October 27-29, 2025

Luong said that Google has been moving pinch IMO’s organizers since past twelvemonth successful mentation for nan trial and wanted to person nan IMO president’s blessing and charismatic grading earlier announcing its charismatic results, which it did connected Monday morning.

“The IMO organizers person their grading guideline,” Luong said. “So immoderate information that’s not based connected that line could not make immoderate declare astir gold-medal level [performance].”

Noam Brown, a elder OpenAI interrogator who worked connected nan IMO model, told TechCrunch that IMO reached retired to OpenAI a fewer months agone astir participating successful a general mathematics competition, but nan ChatGPT-maker declined because it was moving connected earthy connection systems that it thought were much worthy pursuing. Brown says OpenAI didn’t cognize IMO was conducting an informal trial pinch Google.

OpenAI says it hired third-party evaluators — 3 erstwhile IMO medalists who understood nan grading strategy — to people its AI model’s performance. After OpenAI learned of its golden badge score, Brown said nan institution reached retired to IMO, which past told nan institution to hold to denote until aft IMO’s Friday nighttime grant ceremony.

IMO did not respond to TechCrunch’s petition for comment.

Google isn’t needfully incorrect present — it did spell done a much official, rigorous process to execute its golden badge people — but nan statement whitethorn miss nan bigger picture: AI models from respective starring AI labs are improving quickly. Countries from astir nan world sent their brightest students to compete astatine IMO this year, and conscionable a fewer percent of them scored arsenic good arsenic OpenAI and Google’s AI models did.

While OpenAI utilized to person a important lead complete nan industry, it surely feels arsenic though nan title is much intimately matched than immoderate institution would for illustration to admit. OpenAI is expected to merchandise GPT-5 successful nan coming months, and nan institution surely hopes to springiness disconnected nan belief that it still leads nan AI industry.

Maxwell Zeff is simply a elder newsman astatine TechCrunch specializing successful AI. Previously pinch Gizmodo, Bloomberg, and MSNBC, Zeff has covered nan emergence of AI and nan Silicon Valley Bank crisis. He is based successful San Francisco. When not reporting, he tin beryllium recovered hiking, biking, and exploring nan Bay Area’s nutrient scene.