ElliotGlazer Profile Banner
Elliot Glazer Profile
Elliot Glazer

@ElliotGlazer

Followers
2K
Following
13K
Media
60
Statuses
1K

Lead mathematician at Epoch AI.

Joined November 2024
Don't wanna be here? Send us removal request.
@ElliotGlazer
Elliot Glazer
7 days
Some of you guys are cool. Be at Cambridge week of 9/15 🤫
Tweet media one
3
2
40
@ElliotGlazer
Elliot Glazer
2 days
RT @EpochAIResearch: Let’s take a look into GPT-5’s record-setting performance on FrontierMath. How did it perform on the holdout vs. non-h….
0
45
0
@ElliotGlazer
Elliot Glazer
2 days
If you’d like to experiment with models on Project Euler problems, please do so using so as not to interfere with the statistics of the main site!.
3
0
56
@ElliotGlazer
Elliot Glazer
2 days
GPT-5 twoshot it. I informed it after its first attempt that its code didn’t even answer the test cases correctly, and then it got it.
1
0
51
@ElliotGlazer
Elliot Glazer
2 days
Finally, my contact pointed out that 942 is a problem with the top difficulty rating which might be amenable to AI reasoning because it “is the sort of problem where doing free-association on an immense amount of background knowledge is likely to be a big leg up.”.
1
3
55
@ElliotGlazer
Elliot Glazer
2 days
“This one is quite interesting. It's on the machinery-heavy end compared to most human solutions, but it would be entirely believable that a human produced it, with one exception,” noting it effectively asserts that there are no Wall-Sun-Sun primes, which is an open question.
2
0
53
@ElliotGlazer
Elliot Glazer
2 days
I then tried 947 because my contact found this one to be the most surprising of GPT-5’s successes on the MathArena eval. Once again, GPT-5 oneshot it, and I sent the reasoning summary for commentary.
1
1
54
@ElliotGlazer
Elliot Glazer
2 days
He says “This is really remarkable. Very human-like solution, actually,” and was intrigued by the explicit observations GPT-5 makes in describing its solution.
1
1
67
@ElliotGlazer
Elliot Glazer
2 days
950 was the highest difficulty problem solved in MathArena’s evaluation. In my run, GPT-5 oneshot it. I then sent the reasoning summary to my Project Euler contact, who happens to be the problem contributor.
1
0
65
@ElliotGlazer
Elliot Glazer
2 days
I ran GPT-5 Pro on three Project Euler problems, 942 (difficulty: 100%), 947 (diff: 60%), and 950 (preliminary diff: 75%). Let’s go over them in reverse order.
11
29
456
@ElliotGlazer
Elliot Glazer
3 days
RT @littmath: @rishicomplex Nice article. FWIW I’m a bit surprised we haven’t already seen a model crack some mildly interesting (but atten….
0
3
0
@ElliotGlazer
Elliot Glazer
3 days
RT @SebastienBubeck: I copy pasted an unpublished manuscript of mine in ChatGPT and asked it to improve it. I expected that the method we'r….
0
47
0
@ElliotGlazer
Elliot Glazer
3 days
GPT-5 is the best public math model across the board, with top scores on all MathArena tests (including 3/4 successes on a Project Euler 75% !!), SotA on FrontierMath T1-T3, and perhaps even a couple never-before-solved T4's. 🤫.
@j_dekoninck
Jasper Dekoninck
4 days
Its score on Project Euler is the most impressive, outperforming o4-mini by 20% and solving a problem with a 75% difficulty rating 3/4 times.
Tweet media one
4
15
186
@ElliotGlazer
Elliot Glazer
4 days
RT @GregHBurnham: The 2025 AIME has fallen!. On MathArena, GPT-5 (high) has solved the one problem that no prior model had solved, 2/4 time….
0
8
0
@ElliotGlazer
Elliot Glazer
5 days
Google's AI falls hook, line, and sinker for obviously AI-generated math papers and announcements.
Tweet media one
2
1
50
@ElliotGlazer
Elliot Glazer
7 days
The X condition.
@mattyglesias
Matthew Yglesias
7 days
@ArmandDoma The content game is tough �� gotta be topical but also timeless.
0
0
3
@ElliotGlazer
Elliot Glazer
7 days
But hearing more reports these days of it also providing value via brainstorming and testing out rudimentary proof strategies
@nasqret
Bartosz Naskręcki
8 days
Here is a case study in the frontier math research. I know a general answer and have a work in progress preprint where I do more but we spent almost 2 years thinking about this topic. The model got to this level in 15 minutes.
Tweet media one
Tweet media two
Tweet media three
2
0
9
@ElliotGlazer
Elliot Glazer
7 days
A sentiment I’m hearing a lot from mathematicians! Already vast improvement beyond google for technical searches. This is the biggest utility it provides for research at the moment.
@littmath
Daniel Litt
7 days
One axis along which GPT-5 seems to me to be a significant improvement is search. A lot of the time I want to search the literature for mathematical object X with properties ABC. This is obviously very hard to google. Already had a success along these lines with GPT-5.
2
1
53
@ElliotGlazer
Elliot Glazer
8 days
RT @EpochAIResearch: GPT-5 sets a new record on FrontierMath! On our scaffold, GPT-5 with high reasoning effort scores 24.8% (±2.5%) and 8.….
0
37
0