
Peter Gostev
@petergostev
Followers
4K
Following
1K
Media
231
Statuses
598
London π¬π§ AI stuff https://t.co/bkfw1nxdmJ
Joined June 2025
Lead OpenAI researcher: "There a lot of things that we have internally that we are doing, that allow the model to work for much longer. We still didn't figure out the right product to deploy them, but the models can think for 30 minutes, hour, two hours on certain types of
How GPT-5 thinks, with @OpenAI VP of Research @MillionInt 00:00 - Intro 01:01 - What Reasoning Actually Means in AI 02:32 - Chain of Thought: Models Thinking in Words 05:25 - How Models Decide How Long to Think 07:24 - Evolution from o1 to o3 to GPT-5 11:00 - The Road
7
15
220
Lead OpenAI researcher: "GPT-5, in some way, can be considered o3.1 - iteration of the same concept" "What I'm after right now is something next, what would be a significant jump to how we interact with models, that are more capable, think for even longer and interact with even
How GPT-5 thinks, with @OpenAI VP of Research @MillionInt 00:00 - Intro 01:01 - What Reasoning Actually Means in AI 02:32 - Chain of Thought: Models Thinking in Words 05:25 - How Models Decide How Long to Think 07:24 - Evolution from o1 to o3 to GPT-5 11:00 - The Road
5
13
173
One is Microsoft AI another is PI ai - can you tell which one is which?
Iβm thrilled to share our new visual identity for @MicrosoftAI. Grounded in warmth, trust + humanity, we envision a world where technology makes life meaningfully better. Thank you to the MAI design team for bringing this to life. You can explore it now at https://t.co/bZMLBW2bMy
3
1
13
Fruit Machine simulator - testing Haiku 4.5 and all other Claude models over the last 18 months. Remember how Sonnet 3.5 was everyone's favourite? (you embarrassed now? π) Haiku 4.5 (last in the video) is a huge jump from Haiku 3.5 but quite a bit behind Sonnet 4.5 and Opus
Introducing Claude Haiku 4.5: our latest small model. Five months ago, Claude Sonnet 4 was state-of-the-art. Today, Haiku 4.5 matches its coding performance at one-third the cost and more than twice the speed.
0
0
5
Nvidia's DGX Spark goes on sale today and @lmsysorg have done a brilliant bit of benchmarking vs other systems. In short, it is a very usable system for smaller models and closer in performance to Apple's devices (e.g. Mac Mini M4 Pro), but it is priced at $4,000 vs $1,400 for
π SGLang In-Depth Review of the NVIDIA DGX Spark is LIVE! Thanks to @NVIDIAβs early access program, SGLang makes its first ever appearance in a consumer product, the brand-new DGX Spark. The DGX Sparkβs 128GB Unified Memory and Blackwell architecture set a new standard for
61
156
991
From getting ChatGPT Pulse notifications every morning, I learned that OpenAI engineers think that everyone wakes up at 8am
2
0
15
Nvidia's GB200 is a beast on all dimensions - raw performance, price vs performance or tokens per megawatt. The last one is particularly important in a world of constrained energy - if a data centre has only 1 GW of power, you would be 50-100% more from GB200 GPUs than any
Today we are launching InferenceMAX! We have support from Nvidia, AMD, OpenAI, Microsoft, Pytorch, SGLang, vLLM, Oracle, CoreWeave, TogetherAI, Nebius, Crusoe, HPE, SuperMicro, Dell It runs every day on the latest software (vLLM, SGLang, etc) across hundreds of GPUs, $10Ms of
0
0
18
OpenAI Infra team: would you like to go with Nvidia, AMD or a custom chip? @sama: yes
20
19
467
In case you want to listen to it π voiceover with @SunoMusic, video with Sora, subtitled with @veedstudio
1
0
3
My short essay on LLMs & AGI: Have we reached the 'AGI of flight'? Not even close. Our planes can carry hundreds of passengers across the globe in just a few hours, yet nothing we build matches a godwit flying non-stop for eleven days without rest or the agility of a bat
2
0
12
Something I haven't thought about too much, but inference prices for open source model shift quite a bit over time - e.g. I've taken 6 models for which I had snapshots of data in August and October 2025 and variance has been quite large: ~30% drops for OpenAI models and ~60%
1
3
7
Lots of people never created any longer form AI videos, you can absolutely maintain consistency and with much crappier models that this one. The key feature that is missing is start & end frame, it is a pretty regular feature in many other models and I'm sure OpenAI can add it if
0
0
12
It normally costs ~$60m to make a Hollywood movie, with Sora 2 Pro HD it would cost you $2,700 for a 90-minute feature (or $540 for Sora 2 standard) Even if you assume 1% clip success rate, the cost would still only be $270k (or $54k). Sora-2 feels like is the closest to
47
36
385
It was cool to see one of my slides featured on the 'Open Models' talk by @dkundel at the OpenAI DevDay Btw, really worth checking out this and other talks they have uploaded
0
0
5
ChatGPT Agent: "Go to the Agent Builder and create a meme generator agent". It was magnificent, 20 minutes of work, checking documentation, created 9 notes, custom prompts, MCPs... None of it worked or was even close to working but I appreciated the effort
56
36
598