
Hassan Hayat 🔥
@TheSeaMouse
Followers
5K
Following
158K
Media
2K
Statuses
12K
Aspiring Engineer @ General Cognition https://t.co/D4gDyw97gu
Austin, TX
Joined October 2011
Imagine giving up on manufacturing semiconductors just as we are seeing the largest compute and infrastructure scale outs in history.
Intel, the home of Moore's Law, for the first time in history, is evaluating if it will continue at the leading edge. From its 10-Q. "However, if we are unable to secure a significant external customer and meet important customer milestones for Intel 14A, we face the prospect
1
1
2
1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
0
1
9
This may be the breakthrough of the year. The model simulating the tools internally (no environment) and getting a solid answer at the end after hours of thought. Flies in the face of LeCun's arguments.
The model solves these problems without tools like lean or coding, it just uses natural language, and also only has 4.5 hours. We see the model reason at a very high level - trying out different strategies, making observations from examples, and testing hypothesis.
0
0
2
Superintelligence is within view.
Today, we at @OpenAI achieved a milestone that many considered years away: gold medal-level performance on the 2025 IMO with a general reasoning LLM—under the same time limits as humans, without tools. As remarkable as that sounds, it’s even more significant than the headline 🧵.
0
0
2
We need a @FabrizioRomano of AI to keep track of all these transfers.
Scoop: Boris Cherny and Cat Wu are back at Anthropic, two weeks after joining Cursor. 🤯🤯🤯 .
0
0
2
RT @_jasonwei: New blog post about asymmetry of verification and "verifier's law": Asymmetry of verification–the i….
0
242
0
This but with agents.
Introducing SOAR 🚀, a self-improving framework for prog synth that alternates between search and learning (accepted to #ICML!). It brings LLMs from just a few percent on ARC-AGI-1 up to 52%. We’re releasing the finetuned LLMs, a dataset of 5M generated programs and the code. 🧵
1
1
2
New superintelligence benchmark just dropped.
PSA: there’s a guy named Soham Parekh (in India) who works at 3-4 startups at the same time. He’s been preying on YC companies and more. Beware. I fired this guy in his first week and told him to stop lying / scamming people. He hasn’t stopped a year later. No more excuses.
1
0
4