Peter Gostev @petergostev X Profile

Peter Gostev

@petergostev

Followers

4K

Following

1K

Media

231

Statuses

598

London 🇬🇧 AI stuff https://t.co/bkfw1nxdmJ

Joined June 2025

Don't wanna be here? Send us removal request.

Peter Gostev

@petergostev

18 hours

Lead OpenAI researcher: "There a lot of things that we have internally that we are doing, that allow the model to work for much longer. We still didn't figure out the right product to deploy them, but the models can think for 30 minutes, hour, two hours on certain types of

Matt Turck

@mattturck

2 days

How GPT-5 thinks, with @OpenAI VP of Research @MillionInt 00:00 - Intro 01:01 - What Reasoning Actually Means in AI 02:32 - Chain of Thought: Models Thinking in Words 05:25 - How Models Decide How Long to Think 07:24 - Evolution from o1 to o3 to GPT-5 11:00 - The Road

7

15

220

Peter Gostev

@petergostev

1 day

Claude swooping in

1

2

54

Peter Gostev

@petergostev

2 days

Lead OpenAI researcher: "GPT-5, in some way, can be considered o3.1 - iteration of the same concept" "What I'm after right now is something next, what would be a significant jump to how we interact with models, that are more capable, think for even longer and interact with even

Matt Turck

@mattturck

2 days

How GPT-5 thinks, with @OpenAI VP of Research @MillionInt 00:00 - Intro 01:01 - What Reasoning Actually Means in AI 02:32 - Chain of Thought: Models Thinking in Words 05:25 - How Models Decide How Long to Think 07:24 - Evolution from o1 to o3 to GPT-5 11:00 - The Road

5

13

173

Peter Gostev

@petergostev

2 days

Claude Haiku progress: 3 > 3.5 > 4.5

1

0

7

Peter Gostev

@petergostev

2 days

Don't overthink it? No, GPT-5, please do

3

0

5

Peter Gostev

@petergostev

2 days

One is Microsoft AI another is PI ai - can you tell which one is which?

Mustafa Suleyman

@mustafasuleyman

3 days

I’m thrilled to share our new visual identity for @MicrosoftAI. Grounded in warmth, trust + humanity, we envision a world where technology makes life meaningfully better. Thank you to the MAI design team for bringing this to life. You can explore it now at https://t.co/bZMLBW2bMy

3

1

13

Peter Gostev

@petergostev

3 days

Fruit Machine simulator - testing Haiku 4.5 and all other Claude models over the last 18 months. Remember how Sonnet 3.5 was everyone's favourite? (you embarrassed now? 😉) Haiku 4.5 (last in the video) is a huge jump from Haiku 3.5 but quite a bit behind Sonnet 4.5 and Opus

Claude

@claudeai

3 days

Introducing Claude Haiku 4.5: our latest small model. Five months ago, Claude Sonnet 4 was state-of-the-art. Today, Haiku 4.5 matches its coding performance at one-third the cost and more than twice the speed.

0

5

Peter Gostev

@petergostev

4 days

Nvidia's DGX Spark goes on sale today and @lmsysorg have done a brilliant bit of benchmarking vs other systems. In short, it is a very usable system for smaller models and closer in performance to Apple's devices (e.g. Mac Mini M4 Pro), but it is priced at $4,000 vs $1,400 for

LMSYS Org

@lmsysorg

5 days

🚀 SGLang In-Depth Review of the NVIDIA DGX Spark is LIVE! Thanks to @NVIDIA’s early access program, SGLang makes its first ever appearance in a consumer product, the brand-new DGX Spark. The DGX Spark’s 128GB Unified Memory and Blackwell architecture set a new standard for

61

156

991

Peter Gostev

@petergostev

4 days

From getting ChatGPT Pulse notifications every morning, I learned that OpenAI engineers think that everyone wakes up at 8am

2

0

15

Peter Gostev

@petergostev

5 days

Nvidia's GB200 is a beast on all dimensions - raw performance, price vs performance or tokens per megawatt. The last one is particularly important in a world of constrained energy - if a data centre has only 1 GW of power, you would be 50-100% more from GB200 GPUs than any

Dylan Patel

@dylan522p

9 days

Today we are launching InferenceMAX! We have support from Nvidia, AMD, OpenAI, Microsoft, Pytorch, SGLang, vLLM, Oracle, CoreWeave, TogetherAI, Nebius, Crusoe, HPE, SuperMicro, Dell It runs every day on the latest software (vLLM, SGLang, etc) across hundreds of GPUs, $10Ms of

0

18

Peter Gostev

@petergostev

5 days

OpenAI Infra team: would you like to go with Nvidia, AMD or a custom chip? @sama: yes

20

19

467

Peter Gostev

@petergostev

6 days

Can't tell if this is good or bad for my productivity

0

5

Peter Gostev

@petergostev

7 days

In case you want to listen to it 😀 voiceover with @SunoMusic, video with Sora, subtitled with @veedstudio

1

0

3

Peter Gostev

@petergostev

7 days

My short essay on LLMs & AGI: Have we reached the 'AGI of flight'? Not even close. Our planes can carry hundreds of passengers across the globe in just a few hours, yet nothing we build matches a godwit flying non-stop for eleven days without rest or the agility of a bat

2

0

12

Peter Gostev

@petergostev

7 days

Imagine one day you are working for a cool fast growing tech company, then @billpeeb pops up with Sora 2 and calls your social media behemoth a 'legacy platform' 🫠

TBPN

@tbpn

8 days

OpenAI's Head of Sora @billpeeb says a stunning 70% of Sora's nearly 2 million weekly active users are creating content.

0

4

Peter Gostev

@petergostev

8 days

Something I haven't thought about too much, but inference prices for open source model shift quite a bit over time - e.g. I've taken 6 models for which I had snapshots of data in August and October 2025 and variance has been quite large: ~30% drops for OpenAI models and ~60%

1

3

7

Peter Gostev

@petergostev

8 days

Lots of people never created any longer form AI videos, you can absolutely maintain consistency and with much crappier models that this one. The key feature that is missing is start & end frame, it is a pretty regular feature in many other models and I'm sure OpenAI can add it if

0

12

Peter Gostev

@petergostev

9 days

It normally costs ~$60m to make a Hollywood movie, with Sora 2 Pro HD it would cost you $2,700 for a 90-minute feature (or $540 for Sora 2 standard) Even if you assume 1% clip success rate, the cost would still only be $270k (or $54k). Sora-2 feels like is the closest to

47

36

385

Peter Gostev

@petergostev

9 days

It was cool to see one of my slides featured on the 'Open Models' talk by @dkundel at the OpenAI DevDay Btw, really worth checking out this and other talks they have uploaded

OpenAI

@OpenAI

10 days

That's a wrap on DevDay [2025], all sessions are now live to replay. https://t.co/vE6sEjx25V

0

5

Peter Gostev

@petergostev

9 days

ChatGPT Agent: "Go to the Agent Builder and create a meme generator agent". It was magnificent, 20 minutes of work, checking documentation, created 9 notes, custom prompts, MCPs... None of it worked or was even close to working but I appreciated the effort

56

36

598