2025-11-07
I had missed that AlphaEvolve is able to find the construction needed for the 2025 IMO P6. At least, when given this delightful hint... [image]
@azwagner_
Researchers tested Google DeepMind's AlphaEvolve AI coding agent on 67 mathematical problems and found that it discovered improved solutions to ~20 of them
Really happy to share our new paper on using AlphaEvolve for mathematical exploration at scale, written with Javier Gómez-Serrano, Terence Tao, and @GoogleDeepMind's Bogdan Georgie...
2025-07-20
Pretty happy with how my predictions are holding up. 5/6 was the gold medal threshold this year. OAI's “experimental reasoning LLM” got that exactly, failing only to solve the one hard combinatorics problem, P6. My advice remains: look beyond the medal. Brief thread. 1/ [image]
@alexwei_
[Thread] An OpenAI researcher says the company's latest experimental reasoning LLM achieved gold medal-level performance on the 2025 International Math Olympiad
1/N I'm excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world's most pres...