MillionInt Profile Banner
Jerry Tworek Profile
Jerry Tworek

@MillionInt

Followers
24K
Following
2K
Media
130
Statuses
2K

Berry farmer @ OpenAI | o3, o1, GPT4, ChatGPT, Codex, Solved Rubik’s cube with robotic hand | cautious AI optimist

San Francisco, CA
Joined January 2013
Don't wanna be here? Send us removal request.
@MillionInt
Jerry Tworek
1 year
We trained a model and it is good in some things
@OpenAI
OpenAI
1 year
We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond. These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math.
28
54
1K
@MillionInt
Jerry Tworek
3 days
Ablations are for the weak
@agihippo
yi
3 days
ablations are for the weak. just yolo your runs. (ok, do some small amount of ablations, but don't over do it). instinct is everything in ML and AI.
2
2
92
@Polymarket
Polymarket
7 days
BREAKING: Mamdani's odds collapse in NYC Mayoral Election. If he continues falling at the rate he has the past 24h, Cuomo would be the projected winner.
2K
2K
18K
@MillionInt
Jerry Tworek
4 days
yes
@tszzl
roon
4 days
when i've observed the greats operating at their absolute best they resemble cracked out starcraft players to me. at the top of their game they embody action alone and forget causes, consequence, circumstance. this looks psychopathic from the outside: how does he not crack under
1
0
43
@MillionInt
Jerry Tworek
4 days
If a computer makes a bad management decision, it will be met with heavy gradient update Its more than can be guaranteed about many humans
@eshear
Emmett Shear
4 days
Computers are beginning to make management decisions. Therefore we must begin to hold computers accountable for their decisions.
14
20
351
@MillionInt
Jerry Tworek
4 days
It doesn't matter how efficient you are at optimizing the wrong objective
@rajammanabrolu
Prithviraj (Raj) Ammanabrolu
1 year
ML Systems people need to be stopped. Half of these kernel fusions are not numerically stable 😭 Yes it makes GPU go brr but it also breaks policy gradient theorem and makes me question my life decisions every day
3
1
55
@MillionInt
Jerry Tworek
4 days
🤣🤣🤣
@tokenbender
tokenbender
4 days
researchers when asked to switch from bf16 to fp16 and do loss scaling because it is way better for RL
0
1
45
@MillionInt
Jerry Tworek
4 days
Advantage of founder-led companies is the same as of original authors of the codebase They know all the context on why certain decisions were made Whoever inherits complex systems always has this dilemma "if I change this weirdly looking thing will all everything collapse"
8
3
105
@MillionInt
Jerry Tworek
4 days
Codex is far from perfect but the ceiling is actually very high what it can do with good handling. First real agentic product in the world
@yacineMTB
kache
4 days
codex, make it faster >okay done okay codex benchmark dot py says that its faster. can you make it more faster? >okay done okay cool. its faster again. can you add more timing logs so we can make it even faster >okay
10
8
159
@MillionInt
Jerry Tworek
6 days
I don’t know what kink this is but it have a hard time saying "Grace-Blackwell superchip” without a shiver of excitement going down my spine
4
2
66
@MillionInt
Jerry Tworek
7 days
Humans have been sycophantic and reward hacking since forever
@Miles_Brundage
Miles Brundage
7 days
Zohran talks like April ChatGPT-4o
2
3
168
@MillionInt
Jerry Tworek
7 days
In the world of online learning every eval is instantly contaminated Train test split is an anachronism of old days, theres only past and future
6
3
148
@MillionInt
Jerry Tworek
8 days
I’ve always been a great fan of vertical integration - I’ve found inefficiencies around company boundaries jarring TSMC proves me wrong, that in some cases it is a vastly superior strategy to go for a thin slice of a wide market amortising capex through all of the economy
9
7
174
@MillionInt
Jerry Tworek
9 days
Real tragedy is most of those are not even coming from good models
@beffjezos
Beff – e/acc
9 days
This is the greatest flippening of all time. Artificial language models have surpassed biological language models in terms of content generated.
13
6
190
@MillionInt
Jerry Tworek
9 days
The human mind craves stories and makes them from everything around it
9
5
94
@MillionInt
Jerry Tworek
10 days
It’s a good agent
@slow_developer
Haider.
11 days
GPT-5 Codex is the best launch of Q4 2025 it follows instructions, sticks to the guidelines, keeps things simple, and produces optimized code if you're into vibe coding, it might not be for you. but if you know what you want, it beats claude code in every way
5
4
164
@MillionInt
Jerry Tworek
10 days
This is the way
@mitchellh
Mitchell Hashimoto
11 days
I’ve got agents doing some Ghostty work while I’m in an Uber. I’ve got some rote refactoring I’ve been wanting to do and now’s the perfect time. I just review the work every so often.
4
2
64
@MillionInt
Jerry Tworek
11 days
RL really feels like a technical revolution within revolution. It spun up a completely new wave of startups, products and thought leaders on top of a huge wave we were already riding on
16
7
231
@MillionInt
Jerry Tworek
11 days
Codex is giving me factorio dopamine hits times a healthy multiplier
@tszzl
roon
11 days
managing fleets of agents should be more fun than playing factorio with the UI/UX to boot
12
18
405
@MillionInt
Jerry Tworek
12 days
Technical decisions matter kids
@aurko79
Aurko Roy
12 days
Who would have thought that a multi trillion dollar cap company could have been thrown into such chaos (layoffs) by a single technical decision they made a year ago - using expert choice MoEs for their frontier model.
3
8
187
@itsclivetime
Clive Chan
12 days
🧵 Announcing a $30B collaboration with Pringles for datacenter chips (1/N)👇
4
4
98
@MillionInt
Jerry Tworek
12 days
🎯
@samsja19
samsja
12 days
Working on llm RL is one of the most intellectually satisfying things I ever done, both from a system and ml perspective
1
0
96