Adam Zsolt Wagner
@azwagner_
Followers
326
Following
111
Media
7
Statuses
42
Research Scientist @ Google DeepMind | Former Professor of Mathematics @ WPI
London, UK
Joined November 2024
Really happy to share our new paper on using AlphaEvolve for mathematical exploration at scale, written with Javier Gómez-Serrano, Terence Tao, and @GoogleDeepMind's Bogdan Georgiev. We tested it on 67 problems and documented all our successes and failures. 🧵
19
146
878
Very excited that our AlphaProof paper is finally out! It's the final thing I worked on at DeepMind, very satisfying to be able to share the full details now - very fun project and awesome team! https://t.co/OuWDemzAt4
17
101
1K
(1) Our team at @GoogleDeepMind has been collaborating with Terence Tao and Javier Gómez-Serrano to use our AI agents (AlphaEvolve, AlphaProof, & Gemini Deep Think) for advancing Maths research. They find that AlphaEvolve can help discover new results across a range of problems.
27
182
2K
Find out more in our joint paper: https://t.co/ND2v0AZL3q Terence Tao's blog post: https://t.co/Br9qQzHrHO
@UCLA @BrownUniversity
terrytao.wordpress.com
Bogdan Georgiev, Javier Gómez-Serrano, Adam Zsolt Wagner, and I have uploaded to the arXiv our paper “Mathematical exploration and discovery at scale”. This is a longer report on…
3
5
38
While AlphaEvolve didn't succeed on every problem - and relied on expert human guidance to find the right path in many cases - these findings highlight how mathematicians and AI systems can work together to test hypotheses and discover patterns that might otherwise be missed.
1
2
35
A personal favorite result: avoiding isosceles triangles on a grid. On a 64x64 grid, previous work with @f_charton, @JSEllenberg & Geordie Williamson suggested 112 points were possible, but we couldn't find a construction. AlphaEvolve finally found this elusive set!
2
3
46
In a few cases AlphaEvolve discovered general patterns that solve the problem not just in the cases it was asked to solve, but in all cases. Excitingly, a pipeline combining AlphaEvolve with @GoogleDeepMind's Gemini DeepThink and AlphaProof is able to prove such a generalisation
1
2
35
We tested AlphaEvolve on a wide-ranging portfolio of 67 mathematical problems in combinatorics, geometry and more. It found new constructions that improved upon the best-known results on about 20 of them, such as finding denser ways to pack 11 cubes into a larger cube.
1
1
43
Here’s a reminder of how AlphaEvolve works:
Our system uses: 🔵 LLMs: To synthesize information about problems as well as previous attempts to solve them - and to propose new versions of algorithms 🔵 Automated evaluation: To address the broad class of problems where progress can be clearly and systematically measured. 🔵
1
1
31
We tested AlphaEvolve on a wide-ranging portfolio of 67 mathematical problems in combinatorics, geometry and more. It found new constructions that improved upon the best-known results on about 20 of them, such as finding denser ways to pack 11 cubes into a larger cube.
0
0
1
Here’s a reminder of how AlphaEvolve works:
Our system uses: 🔵 LLMs: To synthesize information about problems as well as previous attempts to solve them - and to propose new versions of algorithms 🔵 Automated evaluation: To address the broad class of problems where progress can be clearly and systematically measured. 🔵
1
0
2
We tested AlphaEvolve on a wide-ranging portfolio of 67 mathematical problems in combinatorics, geometry and more. It found new constructions that improved upon the best-known results on about 20 of them, such as finding denser ways to pack 11 cubes into a larger cube.
0
0
5
Here’s a reminder of how AlphaEvolve works:
Our system uses: 🔵 LLMs: To synthesize information about problems as well as previous attempts to solve them - and to propose new versions of algorithms 🔵 Automated evaluation: To address the broad class of problems where progress can be clearly and systematically measured. 🔵
1
0
3
Official results are in - Gemini achieved gold-medal level in the International Mathematical Olympiad! 🏆 An advanced version was able to solve 5 out of 6 problems. Incredible progress - huge congrats to @lmthang and the team!
deepmind.google
The International Mathematical Olympiad (“IMO”) is the world’s most prestigious competition for young mathematicians, and has been held annually since 1959. Each country taking part is represented by…
202
760
6K
Drastic progress on maths with Gemini 2.5! As a math undergrad, I am impressed 🤯 🥈 -> 🥇 ✅ Formal -> Informal ✅ Specialized model -> General model ✅ Available soon ✅ Huge thanks to IMO and congrats to all participants! Blog:
deepmind.google
The International Mathematical Olympiad (“IMO”) is the world’s most prestigious competition for young mathematicians, and has been held annually since 1959. Each country taking part is represented by…
16
80
762
Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this
Super thrilled to share that our AI has has now reached silver medalist level in Math at #imo2024 (1 point away from 🥇)! Since Jan, we now not only have a much stronger version of #AlphaGeometry, but also an entirely new system called #AlphaProof, capable of solving many more
79
231
2K
So I've been messing around with this and LLMs. The quality of Mathlib + AI is getting to the point where mathematicians with *minimal* knowledge of Lean should be able to (with AI assistance) *state* (and themselves verify that the statement has the correct mathematical
🔥 @GoogleDeepMind just dropped their "formal conjectures" project - formalizing statements of math's biggest unsolved mysteries in #LeanLang and #Mathlib! This Google-backed project is a HUGE step toward developing "a much richer dataset of formalized conjectures", valuable
16
37
232
there are infinitely many other methods i could try. each requires careful debugging/tuning but i'm limited by my bandwidth and motivation and even with access to every LLM i failed. this is why having infinite compute + an LLM evolve your code wins. it doesn't have to care.
1
1
30
Google released AlphaEvolve. I'm trying to get a sense of whether the problems are hard to solve numerically. Let's focus on problem B.1. i'm going to do this live.
lot's of 'optimizing the constants in the proof' in this (very neat) google release did you know Terry Tao is also building an application to try to verify estimates in analysis? i dreamed of similar things years ago, but now it feels really possible.
7
72
995