
Junhao Chen
@Cumquaaa
Followers
14
Following
32
Media
4
Statuses
12
Senior @Tsinghua_Uni, previously interned @tsvetshop. My interests lie in NLP, CV and RL.
Joined August 2023
🎯 Takeaway: Tune your AR⇄Diffusion balance to match your compute budget, more AR for speed and more diffusion for quality. 🙏 Huge thanks to @XiaochuangHan and Yulia @tsvetshop for the collaboration! Keen to hear your thoughts!. ( 4 /🧵).
0
0
2
@tsvetshop @XiaochuangHan 🔍 MADFormer explores the Mixed AR + Diffusion design space in image generation Transformers. - Token axis: split an image into blocks of patches (e.g. 4/16/64 blocks), with AR conditioning across the blocks and diffusion within the block. - Layer axis: allocate the first (N–D)
1
0
2
RT @stingning: Our framework supports various online RL algorithms. In our experiments, we use GRPO with the following optimizations:.1️⃣ P….
0
2
0
RT @stingning: SOTA: SimpleVLA-RL achieves 98.4% on LIBERO. 🎯With only 1 trajectory/task for SFT:. 🚀LIBERO-Avg: 48.9%→94.1%. 🚀LIBERO-Long….
0
1
0
RT @stingning: Moonlighting a bit: we implement Online RL for VLA models with @verl🤖, and find simple outcome rewards can work surprisingly….
0
8
0
RT @ZhiyuanZeng_: 🛠️ Build your own LLM (or benchmark) analysis/debugging tool 🚀 Try our demo .
0
17
0
RT @ZhiyuanZeng_: Is a single accuracy number all we can get from model evals?🤔.🚨Does NOT tell where the model fails.🚨Does NOT tell how to….
0
87
0