Yue Wu Profile
Yue Wu

@FrankYueWu1

Followers
2K
Following
335
Media
9
Statuses
61

Scaling RL @xAI | Prev. Postdoc @Princeton, CS PhD @UCLA. BSc @PKU1898.

Joined October 2019
Don't wanna be here? Send us removal request.
@tokenbender
tokenbender
6 days
@rosinality
Rosinality
6 days
FP16 can have a smaller training-inference gap compared to BFloat16, thus fits better for RL. Even the difference between RL algorithms vanishes once FP16 is adopted. Surprising!
11
69
2K
@lightetal
Jonathan @SF
20 days
🎉 Thrilled to share that our paper “DISC: Dynamic Decomposition Improves LLM Inference Scaling” has been accepted to #neurips2025 ! 🚀 Here’s how we push reasoning and inference scaling to new heights 🧵 1/n
4
6
18
@MillionInt
Jerry Tworek
20 days
This is mostly how I imagine postagi life
@khoomeik
Rohan Pandey
20 days
nothin better than kicking off a couple codex jobs and going back to watching a physics lecture i don't understand (from Prof. Coskun Kocabas talk on topological effects in graphene at @periodiclabs)
12
38
715
@FrankYueWu1
Yue Wu
1 month
If RL has 1 bit per rollout (0/1 reward) what is the amount of information of one rollout in SFT/pretraining?
@jxmnop
dr. jack morris
1 month
best paper or blog i've read in a while, highly recommend! John is brilliant and his research sets an example for the rest of us. recently i too have been thinking deeply about how many bits might be learned via one step of RL or SFT.. if you're thinking about this too, lmk!
5
0
28
@EdwardSun0909
Zhiqing Sun
1 month
I don’t often tweet on technical topics but I may have an opposite opinion here…
10
8
384
@Yuhu_ai_
Yuhuai (Tony) Wu
1 month
Hiring for a new team building computer control agents. Join us to build Grok5 / macrohard later this year. DM me! Will send out a job post soon too.
528
1K
7K
@Yuhu_ai_
Yuhuai (Tony) Wu
2 months
Grok4 Fast maximizing intelligence density.
@xai
xAI
2 months
Introducing Grok 4 Fast, a multimodal reasoning model with a 2M context window that sets a new standard for cost-efficient intelligence. Available for free on https://t.co/AnXpIEOhOD, https://t.co/53pltypvkw, iOS and Android apps, and OpenRouter. https://t.co/3YZ1yVwueV
38
63
752
@FrankYueWu1
Yue Wu
2 months
Join us to build world's largest RL system.
@s_tworkowski
Szymon Tworkowski
2 months
Come join us to work on 🔢🧮 for 🔄! https://t.co/f3elwtEHxM
6
12
192
@FrankYueWu1
Yue Wu
2 months
Our vending machine at @xai run by @grok just randomly decided to order some AGI pills for us. Ingredient unknown yet.
360
160
5K
@FrankYueWu1
Yue Wu
2 months
Remarkable that TML achieves 0-KL RL.
@thinkymachines
Thinking Machines
2 months
Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to
0
0
18
@liujiashuo77
Jiashuo Liu
3 months
We built FutureX, the world’s first live benchmark for real future prediction — politics, economy, culture, sports, etc. Among 23 AI agents, #Grok4 ranked #1 🏆 Elon didn’t lie. @elonmusk your model sees further 🚀🍀 LeaderBoard: https://t.co/fwck0NROHZ
229
206
1K
@tianle_cai
Tianle Cai
3 months
Is X data the key to mastering AI predictions? FutureX transforms prediction markets into a dynamic, real-time benchmark for agents—and Grok4 tops the leaderboard, #1 among 25 models! 🏆Good job @grok ! Check it out: https://t.co/xisDgWdSjw
@liujiashuo77
Jiashuo Liu
3 months
We built FutureX, the world’s first live benchmark for real future prediction — politics, economy, culture, sports, etc. Among 23 AI agents, #Grok4 ranked #1 🏆 Elon didn’t lie. @elonmusk your model sees further 🚀🍀 LeaderBoard: https://t.co/fwck0NROHZ
2
3
25
@Yuhu_ai_
Yuhuai (Tony) Wu
3 months
You asked for it! Grok4 for free.
@xai
xAI
3 months
Grok 4 is now free for all users worldwide! Simply use Auto mode, and Grok will route complex queries to Grok 4. Prefer control? Choose "Expert" anytime to always use Grok 4. For a limited time, we are rolling out generous usage limits so you can explore Grok 4’s full
60
115
1K
@Yuhu_ai_
Yuhuai (Tony) Wu
3 months
Very proud of us @xai after seeing the GPT5 release. With a much smaller team, we are ahead in many. Grok4 world’s first unified model, and crushing GPT5 in benchmarks like ARC-AGI. @OpenAI is a very respectful competitor and still the leader in many, but we’re fast and
334
293
5K
@nrehiew_
wh
3 months
The problem with all these agent companies/products is that since you don’t have access to the underlying weights, the bet you’re making is that your scaffolds are better than the labs. This is hard because: 1) The labs can bake the scaffolds into the model (Claude Code) 2)
36
20
487
@Saaaang94
SangBin Cho
3 months
We are hiring! Interested in optimizing/scaling RL framework for pretrain scale RL? DM me or apply here:
Tweet card summary image
job-boards.greenhouse.io
Palo Alto, CA
7
85
491
@2prime_PKU
Yiping Lu
3 months
Anyone knows adam?
267
447
5K
@mckaywrigley
Mckay Wrigley
4 months
My thoughts on Grok 4 Heavy after 12hrs: Crazy good! “Create an animation of a crowd of people walking to form “Hello world, I am Grok” as camera changes to birds-eye.” And it 1-shotted the *entire* thing. No other model comes close. Watch the full clip.
527
1K
10K
@Yuhu_ai_
Yuhuai (Tony) Wu
4 months
Looking forward to what Grok can do next - pushing frontiers of science and engineering, making new discoveries!
@xai
xAI
4 months
Introducing Grok 4, the world's most powerful AI model. Watch the livestream now:
29
51
518
@arcprize
ARC Prize
4 months
Grok 4 (Thinking) achieves new SOTA on ARC-AGI-2 with 15.9% This nearly doubles the previous commercial SOTA and tops the current Kaggle competition SOTA
243
725
5K