Simeon_Cps Profile Banner
Siméon Profile
Siméon

@Simeon_Cps

Followers
9K
Following
25K
Media
501
Statuses
6K

Creating more common knowledge on AI risks, one tweet at a time. Founder in Paris. AI auditing, standardization & governance.

Joined May 2020
Don't wanna be here? Send us removal request.
@Simeon_Cps
Siméon
7 days
The wave that first hit protein folding scientists in 2019 is now coming for mathematicians.
@_Dave__White_
Dave White
8 days
the openai IMO news hit me pretty heavy this weekend. i'm still in the acute phase of the impact, i think. i consider myself a professional mathematician (a characterization some actual professional mathematicians might take issue with, but my party my rules) and i don't think i.
1
0
5
@Simeon_Cps
Siméon
9 days
Google is crushing it. They got their gold in natural language with Gemini. It seems like they mostly caught up to OpenAI, in less than a year.
@demishassabis
Demis Hassabis
9 days
Official results are in - Gemini achieved gold-medal level in the International Mathematical Olympiad! 🏆 An advanced version was able to solve 5 out of 6 problems. Incredible progress - huge congrats to @lmthang and the team!
2
0
18
@Simeon_Cps
Siméon
10 days
You can literally solve it though. Why aren't you focusing your efforts on this?.
@elonmusk
Elon Musk
10 days
At times, AI existential dread is overwhelming.
8
1
97
@Simeon_Cps
Siméon
11 days
i'd like to read some commentary by people with relevant domain knowledge about these proofs. Maybe @davidad, @an_interstice or @FabienDRoger? . Do they feel like the kind of proofs that are relying on a ton of knowledge and leveraging how knowledgeable these models are or do.
@alexwei_
Alexander Wei
11 days
10/N If you want to take a look, here are the model’s solutions to the 2025 IMO problems! The model solved P1 through P5; it did not produce a solution for P6. (Apologies in advance for its … distinct style—it is very much an experimental model 😅).
1
0
7
@Simeon_Cps
Siméon
11 days
Here we are, crushing benchmarks that characterize the top of human fluid intelligence.
@alexwei_
Alexander Wei
11 days
1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
Tweet media one
2
1
17
@Simeon_Cps
Siméon
13 days
Meta’s risk management framework better than Google DeepMind’s 👀. That’s what we found in our updated ratings focused on risk management frameworks of companies that signed the Seoul Frontier Safety Commitments. Findings:. 1. Anthropic is still in the lead but Anthropic’s RSP
Tweet media one
@TIME
TIME
13 days
Top AI companies have ‘unacceptable’ risk management, studies say
5
8
75
@Simeon_Cps
Siméon
14 days
I'm seeing contradictory reports on whether the H20 license is only for existing inventory (which is 100k-200k GPUs afaiu) or if it's longer term? . Which one is it?.
0
0
4
@Simeon_Cps
Siméon
14 days
So, uh, what's the Windsurf deal? OpenAI pays the $3B, Google acqui-hires the founders and Cognition acquires the leftovers?
Tweet media one
2
0
11
@Simeon_Cps
Siméon
15 days
i feel like the Claude Pro (whether personal or team plan) is limited to like 5 Opus interactions per day? . it feels really overpriced for what it gives access to. I'm constantly rate limited despite using it pretty sparsely.
3
0
5
@Simeon_Cps
Siméon
16 days
pure alpha just dropped.
@jacob_feldgoise
Jacob Feldgoise
16 days
So excited to share that today @CSETGeorgetown and @emergingtechobs are launching an ✨updated✨ version of our chip supply chain explorer! We've got:.👉 New data.👉 New features.👉 New analysis. Links in thread
Tweet media one
0
0
5
@Simeon_Cps
Siméon
16 days
1
0
1
@Simeon_Cps
Siméon
16 days
any recommendation of reading to understand at a deep technical level GPUs?.
2
0
4
@Simeon_Cps
Siméon
17 days
Moonshot, the Margin Slayers.
@teortaxesTex
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
18 days
For a wide range of tasks, K2 is probably the cheapest model by far right now, in terms of actual costs per task. It is just cheap, it has no long-CoT, and it does not yap. This is very refreshing. Like the best of Anthropic models, but cheaper and even more to the point.
0
0
2
@Simeon_Cps
Siméon
19 days
These costs 👀
Tweet media one
@AndrewCurran_
Andrew Curran
19 days
Rumblings all morning it was going to arrive, and here it is. Open source, and comparable to the best models in the world.
Tweet media one
0
0
17
@Simeon_Cps
Siméon
19 days
Happy to support if there's any knowledge/what-should-we-do bottleneck. You guys have Hendrycks though so I'm guessing you know what you have to do.
0
0
4
@Simeon_Cps
Siméon
19 days
This could be up to 10x the compute that went into o3 post-training btw.
@Simeon_Cps
Siméon
2 months
@teortaxesTex Do you know how much FLOP they spent on this post training?. I've heard that o3 was 1 OOM away from saturating the equivalent of compute OpenAI spends on pre-training. So that would still make it not too far from what DS can reach.
0
0
16
@Simeon_Cps
Siméon
19 days
@Simeon_Cps
Siméon
1 year
People used to think that AI was a software thing. As “pushing the AI frontier” looks increasingly like that, it’s gonna get clearer why @xAI has a surprisingly high chance of getting ahead: Elon is unparalleled in solving hard logistics problems.
Tweet media one
0
0
2
@Simeon_Cps
Siméon
19 days
Not saying "I told you so" but. I told you so :)
Tweet media one
@deedydas
Deedy
20 days
Insane that Elon Musk has pulled it off again, absolutely crushing the AI wars with Grok 4. Summarizing the core announcements:.— Post-training RL spend == pretraining spend.— $3/M input told, $15/M output toks, 256k context, price 2x beyond 128k.— #1 on Humanity’s Last Exam
Tweet media one
3
0
10
@Simeon_Cps
Siméon
19 days
Exciting! Not sure it's gonna work but definitely worth fighting this fight that most have dropped! . Congrats Luke & Rudolf.
@WorkshopLabsPBC
Workshop Labs
21 days
Announcing Workshop Labs, a public benefit company.
Tweet media one
0
0
6