Ted Sanders @sandersted X Profile

Ted Sanders

@sandersted

Followers

9K

Following

1K

Media

14

Statuses

747

Research at OpenAI. Be kind to others, and yourself.

https://t.co/e5B14lvMQK

San Francisco, CA

Joined September 2009

Don't wanna be here? Send us removal request.

Ted Sanders

@sandersted

2 months

LLMs are still far from being able to do most human work, but the pace of progress impresses me: in ~64 weeks, they've gone from 12% to 48% on GDPval. (also impressed that OpenAI keeps publishing useful papers & data, against its profit incentives) https://t.co/9Pzn26div1

openai.com

We’re introducing GDPval, a new evaluation that measures model performance on economically valuable, real-world tasks across 44 occupations.

0

14

32

Mostafa Rohaninejad

@MostafaRohani

2 months

1/n I’m really excited to share that our @OpenAI reasoning system got a perfect score of 12/12 during the 2025 ICPC World Finals, the premier collegiate programming competition where top university teams from around the world solve complex algorithmic problems. This would have

140

450

3K

Ted Sanders

@sandersted

3 months

"super curious to see how quickly it goes from a curiosity to something useful." one of my favorite things about writing is how it helps you travel through time, by reminding yourself what you thought in the past. I don't remember being impressed by GPT-3, but apparently I was!

0

2

28

Ted Sanders

@sandersted

3 months

"obviously it's dumb and makes mistakes and apparently requires a fair bit of tuning to get good results, but when it does get those good results, it seems magical. I'd really like to understand this magic better. (3/4)

1

15

Ted Sanders

@sandersted

3 months

"with the first few examples, I brushed it off with my usual pessimism as probably an overfit model regurgitating text from its training data, but having seen so many now, it now seems like this model has really captured some of the essence of language. (2/4)

1

13

Ted Sanders

@sandersted

3 months

found an email I wrote to friends in 2020 about GPT-3: "the GPT-3 examples are wild! they have substantially lifted my feelings of what machine learning is capable of in the next decade. more impressive than everything else OpenAI has done put together. (1/4)

3

6

85

Lisan al Gaib

@scaling01

3 months

As per tradition: A thread with all positive results I created or shared about GPT-5 It's not my fault negative results take-off more than positive ones I can't make a thread with more than 25 posts, but here you go:

9

162

Sam Altman

@sama

3 months

today we are significantly increasing rate limits for reasoning for chatgpt plus users, and all model-class limits will shortly be higher than they were before gpt-5. we will also shortly make a UI change to indicate which model is working.

1K

636

11K

Lech Mazur

@LechMazur

3 months

GPT-5 (medium reasoning) is the new leader on the Short Story Creative Writing benchmark! GPT-5 mini (medium reasoning) is much better than o4-mini (medium reasoning). Claude Opus 4.1 shows gains over Opus 4.

38

34

208

Daniel J

@djarosai

3 months

Full results: GPT-5 maintains strong performance. GPT-5-mini notably competitive with o3 and gemini-2.5-pro. Absolute accuracy numbers depend on instruction and task complexity and will vary across settings—key takeaway is relative model rankings and degradation patterns

1

3

11

Cognition

@cognition

3 months

GPT-5 represents a huge step up over previous OpenAI models, such as GPT-4.1. We believe GPT-5 is at the frontier of agentic ability and shines on tasks that require complex code understanding. On our junior SWE evals, GPT-5 is particularly strong at code exploration and

2

9

63

eric zakariasson

@ericzakariasson

3 months

gpt-5 is now free in @cursor_ai, go try it out! (reload cursor if you don't see it yet) we've worked closely with @OpenAI team to make this happen, and together we also put out a prompting guide for gpt-5. here are some examples prompts we've seen working well

Cursor

@cursor_ai

3 months

GPT-5 is now available in Cursor. It’s the most intelligent coding model our team has tested. We're launching it for free for the time being. Enjoy!

37

38

695

Ted Sanders

@sandersted

3 months

^reminds me of the saying that a film is born three times: first in writing, then filming, and lastly in editing. AI product experiences are also born three times: first in model training, then in product, and lastly in prompting. the right prompt can cure a lot of flaws!

1

16

Lech Mazur

@LechMazur

3 months

GPT-5 (medium reasoning) sets a new record on the Confabulations/Hallucinations on Provided Texts benchmark!

18

17

134

Ted Sanders

@sandersted

3 months

a cool thing you get to see building AI products: you can plug a new model into a product and see it kinda suck. but then you do a week of prompt & product optimization, and the same model shines. the user experience is not just the model; it's the model + product + prompt.

2

49

Ted Sanders

@sandersted

3 months

GPT-5 is here! it's way better at coding - not just in pointless evals, but real usage. using it in @cursor_ai, I think my favorite leap is code Q&A - it's been truly useful at helping me figure out our complex RL codebase. very tenacious digger! https://t.co/TRwscI3Ffd

openai.com

The best model for coding and agentic tasks.

5

1

9

Ted Sanders

@sandersted

3 months

GPT-5 is out! it's by no means perfect, but it's better than what's come before. if you have complaints about its coding, hit me up and I'll see if we can make future models even better for you

lmarena.ai

@arena

3 months

GPT-5 is here - and it’s #1 across the board. 🥇#1 in Text, WebDev, and Vision Arena 🥇#1 in Hard Prompts, Coding, Math, Creativity, Long Queries, and more Tested under the codename “summit”, GPT-5 now holds the highest Arena score to date. Huge congrats to @OpenAI on this

7

6

91

lmarena.ai

@arena

3 months

GPT-5 is here - and it’s #1 across the board. 🥇#1 in Text, WebDev, and Vision Arena 🥇#1 in Hard Prompts, Coding, Math, Creativity, Long Queries, and more Tested under the codename “summit”, GPT-5 now holds the highest Arena score to date. Huge congrats to @OpenAI on this

OpenAI

@OpenAI

3 months

GPT-5 is here. Rolling out to everyone starting today. https://t.co/rOcZ8J2btI

117

403

3K

Ted Sanders

@sandersted

1 year

AGI is hard to define. my preferred definition of AGI is a computer system that can can accomplish a task impossible for 100 human geniuses working together, such as publishing a blog post with a single canonical spelling of GPT-4o / gpt-4o / gpt4o

25

10

277

Ted Sanders

@sandersted

1 year

When these electrical connections are connected to a load, then electrons can get through and accompany the lithium ions flowing through the electrolyte/separator. And these electrons do work, powering your lightbulb or computer or gameboy or whatever. 💡

0

4