Jerry Tworek @MillionInt X Profile

Jerry Tworek

@MillionInt

Followers

24K

Following

2K

Media

130

Statuses

2K

Berry farmer @ OpenAI | o3, o1, GPT4, ChatGPT, Codex, Solved Rubik’s cube with robotic hand | cautious AI optimist

https://t.co/te9BfYvMr4

San Francisco, CA

Joined January 2013

Don't wanna be here? Send us removal request.

Jerry Tworek

@MillionInt

1 year

We trained a model and it is good in some things

OpenAI

@OpenAI

1 year

We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond. These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math.

28

55

1K

Jerry Tworek

@MillionInt

1 day

Haskell is amazing because you can name functions liftM2 and foldr1 and everyone is chill about it

2

36

AIOZ Network

@AIOZNetwork

2 days

Why Adaptive Bitrate Streaming (ABR) supercharges AIOZ Stream? Our player adjusts quality in real-time to your device & network speed, delivering maximum QoE (Quality of Experience). Key Features: → Global DePIN devices for minimized latency → Increased redundancy during

11

25

115

Jerry Tworek

@MillionInt

1 day

That reminds me of some tweet….

Sam Altman

@sama

2 days

A thing often in common among great startup investors, founders, and researchers: Trading making a lot of small mistakes in exchange for getting a few giant wins. (Surprisingly many people seem to prefer a few big mistakes in exchange for a lot of small wins.)

2

1

114

Jerry Tworek

@MillionInt

6 days

Ablations are for the weak

yi

@agihippo

6 days

ablations are for the weak. just yolo your runs. (ok, do some small amount of ablations, but don't over do it). instinct is everything in ML and AI.

2

93

Jerry Tworek

@MillionInt

7 days

yes

roon

@tszzl

7 days

when i've observed the greats operating at their absolute best they resemble cracked out starcraft players to me. at the top of their game they embody action alone and forget causes, consequence, circumstance. this looks psychopathic from the outside: how does he not crack under

1

0

43

George Jeffreys

@GeJeffreys

9 days

@karenvaites Honestly, my takeaway from this isn't that tracking is bad, it's that teachers can't be trusted to do it right. Which, functionally, might be the same thing.

0

14

Jerry Tworek

@MillionInt

7 days

If a computer makes a bad management decision, it will be met with heavy gradient update Its more than can be guaranteed about many humans

Emmett Shear

@eshear

7 days

Computers are beginning to make management decisions. Therefore we must begin to hold computers accountable for their decisions.

14

20

351

Jerry Tworek

@MillionInt

7 days

It doesn't matter how efficient you are at optimizing the wrong objective

Prithviraj (Raj) Ammanabrolu

@rajammanabrolu

1 year

ML Systems people need to be stopped. Half of these kernel fusions are not numerically stable 😭 Yes it makes GPU go brr but it also breaks policy gradient theorem and makes me question my life decisions every day

3

1

55

Jerry Tworek

@MillionInt

7 days

🤣🤣🤣

tokenbender

@tokenbender

7 days

researchers when asked to switch from bf16 to fp16 and do loss scaling because it is way better for RL

0

1

45

Jerry Tworek

@MillionInt

7 days

Advantage of founder-led companies is the same as of original authors of the codebase They know all the context on why certain decisions were made Whoever inherits complex systems always has this dilemma "if I change this weirdly looking thing will all everything collapse"

8

3

105

Kanu Gulati, Partner @Khosla Ventures

@KanuGulati

2 days

So, I just got back from Hangzhou, China; Attended @IROS2025. Takeaway: China’s robotics ecosystem is moving faster and more coordinated than the US

14

8

48

Jerry Tworek

@MillionInt

7 days

Codex is far from perfect but the ceiling is actually very high what it can do with good handling. First real agentic product in the world

kache

@yacineMTB

7 days

codex, make it faster >okay done okay codex benchmark dot py says that its faster. can you make it more faster? >okay done okay cool. its faster again. can you add more timing logs so we can make it even faster >okay

10

8

158

Jerry Tworek

@MillionInt

9 days

I don’t know what kink this is but it have a hard time saying "Grace-Blackwell superchip” without a shiver of excitement going down my spine

4

2

66

Jerry Tworek

@MillionInt

10 days

Humans have been sycophantic and reward hacking since forever

2

3

168

Jerry Tworek

@MillionInt

10 days

In the world of online learning every eval is instantly contaminated Train test split is an anachronism of old days, theres only past and future

6

3

148

Jerry Tworek

@MillionInt

11 days

I’ve always been a great fan of vertical integration - I’ve found inefficiencies around company boundaries jarring TSMC proves me wrong, that in some cases it is a vastly superior strategy to go for a thin slice of a wide market amortising capex through all of the economy

9

7

175

Jerry Tworek

@MillionInt

12 days

Real tragedy is most of those are not even coming from good models

Beff – e/acc

@beffjezos

12 days

This is the greatest flippening of all time. Artificial language models have surpassed biological language models in terms of content generated.

13

6

189

Jerry Tworek

@MillionInt

12 days

The human mind craves stories and makes them from everything around it

9

5

94

Jerry Tworek

@MillionInt

14 days

It’s a good agent

Haider.

@slow_developer

14 days

GPT-5 Codex is the best launch of Q4 2025 it follows instructions, sticks to the guidelines, keeps things simple, and produces optimized code if you're into vibe coding, it might not be for you. but if you know what you want, it beats claude code in every way

5

4

164

Jerry Tworek

@MillionInt

14 days

This is the way

Mitchell Hashimoto

@mitchellh

14 days

I’ve got agents doing some Ghostty work while I’m in an Uber. I’ve got some rote refactoring I’ve been wanting to do and now’s the perfect time. I just review the work every so often.

4

2

64

Jerry Tworek

@MillionInt

14 days

RL really feels like a technical revolution within revolution. It spun up a completely new wave of startups, products and thought leaders on top of a huge wave we were already riding on

16

7

230

Jerry Tworek

@MillionInt

14 days

Codex is giving me factorio dopamine hits times a healthy multiplier

roon

@tszzl

14 days

managing fleets of agents should be more fun than playing factorio with the UI/UX to boot

12

18

406

Jerry Tworek

@MillionInt

15 days

Technical decisions matter kids

Aurko Roy

@aurko79

15 days

Who would have thought that a multi trillion dollar cap company could have been thrown into such chaos (layoffs) by a single technical decision they made a year ago - using expert choice MoEs for their frontier model.

3

8

187