
Noam Brown
@polynoamial
Followers
89K
Following
6K
Media
127
Statuses
1K
Researching reasoning @OpenAI | Co-created Libratus/Pluribus superhuman poker AIs, CICERO Diplomacy AI, and OpenAI o3 / o1 / š reasoning models
San Francisco, CA
Joined January 2017
Today, Iām excited to share with you all the fruit of our effort at @OpenAI to create AI models capable of truly general reasoning: OpenAI's new o1 model series! (aka š) Let me explain š§µ 1/
225
2K
11K
RT @sonyatweetybird: Our latest Training Data episode with the @OpenAI IMO Gold team is out!. @alexwei_ @polynoamial @SherylHsu02 joined toā¦.
0
16
0
It can be hard to āfeel the AGIā until you see an AI master a domain you care deeply about. Everyone will have their Lee Sedol moment at a different time.
the openai IMO news hit me pretty heavy this weekend. i'm still in the acute phase of the impact, i think. i consider myself a professional mathematician (a characterization some actual professional mathematicians might take issue with, but my party my rules) and i don't think i.
82
100
1K
RT @alexwei_: On IMO P6 (without going into too much detail about our setup), the model "knew" it didn't have a correct solution. The modelā¦.
0
164
0
We had each submitted proof graded by 3 external IMO medalists and there was unanimous consensus on correctness. We have also posted the proofs publicly so that anyone can verify correctness.
github.com
Contribute to aw31/openai-imo-2025-proofs development by creating an account on GitHub.
6/N In our evaluation, the model solved 5 of the 6 problems on the 2025 IMO. For each problem, three former IMO medalists independently graded the modelās submitted proof, with scores finalized after unanimous consensus. The model earned 35/42 points in total, enough for gold! š„.
3
7
266
RT @austinc3301: Iām giving a talk on the speed of progress on LLM capabilities in 3 hours, gotta update the slides šš.
0
33
0
Their bet allowed for formal math AI systems (like AlphaProof). In 2022, almost nobody thought an LLM could be IMO gold level by 2025.
We are seeing much faster AI progress than **Paul Christiano** and **Yudkowsky** predicted, who had gold in 2025 at 8% and 16% respectively, by methods that are more general than expected.
35
69
1K
It takes us a few months to turn the experimental research frontier into a product. But progress is so fast that a few months can mean a big difference in capabilities.
So, all the models underperform humans on the new International Mathematical Olympiad questions, and Grok-4 is especially bad on it, even with best-of-n selection? Unbelievable!
32
51
972
@OpenAI In case you stumbled upon this and don't know what I'm talking about:
1/N Iām excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the worldās most prestigious math competitionāthe International Math Olympiad (IMO).
4
3
211
I think it's safe to say this @OpenAI IMO gold result came as a bit of a surprise to folks
81
155
3K
Sheryl (@sherylhsu02) was our first hire onto the multi-agent team. Within a few months of joining, she helped to make this possible. We're so lucky to have her on the team!.
Watching the model solve these IMO problems and achieve gold-level performance was magical. A few thoughts š§µ.
28
24
739