greghburnham · TEXXR

2025-11-07

I had missed that AlphaEvolve is able to find the construction needed for the 2025 IMO P6. At least, when given this delightful hint... [image]

2025-11-07 View on X

@azwagner_

Researchers tested Google DeepMind's AlphaEvolve AI coding agent on 67 mathematical problems and found that it discovered improved solutions to ~20 of them

Really happy to share our new paper on using AlphaEvolve for mathematical exploration at scale, written with Javier Gómez-Serrano, Terence Tao, and @GoogleDeepMind's Bogdan Georgie...

View original

2025-07-20

Pretty happy with how my predictions are holding up. 5/6 was the gold medal threshold this year. OAI's “experimental reasoning LLM” got that exactly, failing only to solve the one hard combinatorics problem, P6. My advice remains: look beyond the medal. Brief thread. 1/ [image]

2025-07-20 View on X

@alexwei_

[Thread] An OpenAI researcher says the company's latest experimental reasoning LLM achieved gold medal-level performance on the 2025 International Math Olympiad

1/N I'm excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world's most pres...

View original