RuiYang70669025 Profile Banner
Rui Yang Profile
Rui Yang

@RuiYang70669025

Followers
354
Following
1K
Media
19
Statuses
122

PhD student @ UIUC

Illinois, USA
Joined November 2022
Don't wanna be here? Send us removal request.
@RuiYang70669025
Rui Yang
6 months
🤖Can MLLM agents reason about spatial relationships and plan atomic actions for navigation & manipulation?. 🔥 Meet EmbodiedBench 🏆—the first fine-grained benchmark for MLLM-based embodied agents!. 📄 Paper: 🌐 Website & code:
Tweet media one
4
36
170
@RuiYang70669025
Rui Yang
19 days
RT @Yong18850571: 🔥Our Goedel-Prover-V2-32B topped the PutnamBench Leaderboard by solving 86 problems —nearly 2× more than the previous SO….
0
15
0
@RuiYang70669025
Rui Yang
19 days
RT @HolarisSun: 🚀 RL is powering breakthroughs in LLM alignment, reasoning, and agentic apps. Are you ready to dive into the RL x LLM front….
Tweet card summary image
huggingface.co
0
12
0
@RuiYang70669025
Rui Yang
22 days
RT @hc81Jeremy: Grateful for the chance to present EmbodiedBench at ICML as an Oral. A rewarding experience full of learning. Thanks for @….
0
3
0
@RuiYang70669025
Rui Yang
25 days
RT @RickyRDWang: 🚀 Introducing MA-LoT Theorem Framework: An open-source multi-agent framework utilizing the Long Chain-of-Thought to boost….
0
9
0
@RuiYang70669025
Rui Yang
26 days
RT @Yong18850571: (1/4)🚨 Introducing Goedel-Prover V2 🚨.🔥🔥🔥 The strongest open-source theorem prover to date. 🥇 #1 on PutnamBench: Solves 6….
0
85
0
@RuiYang70669025
Rui Yang
26 days
Forgot to change my time zone in my last post🤣.
0
0
0
@RuiYang70669025
Rui Yang
26 days
My coauthor @hc81Jeremy will present EmbodiedBench at ICML 2025! 🤖.Oral Session 6A.📍 West Hall C 🕧July 17 3:30-3:45 pmPDT.📌 Poster Session.📍 East Hall A-B #E-2411🕜 July 17 4:30-7 pm PDT.Come say hi and let’s talk about VLM agent training, evaluation, and benchmarking! 😀
Tweet media one
Tweet media two
3
3
10
@RuiYang70669025
Rui Yang
29 days
RT @zhenhailongW: Learning to perceive while learning to reason!.We introduce PAPO: Perception-Aware Policy Optimization, a direct upgrade….
0
14
0
@RuiYang70669025
Rui Yang
29 days
RT @tinner_he: 🤩Mind-blowing discovery: Random policies can be surprisingly powerful for decision-making! Our ICML 2025 paper reveals how s….
0
3
0
@RuiYang70669025
Rui Yang
30 days
RT @5000hui: @LiJunnan0409 Awesome work! 🥂 I feel like the design of our GUI-Actor — which can propose multiple candidate regions in one fo….
0
1
0
@RuiYang70669025
Rui Yang
2 months
Insightful post on the scalability of off-policy RL.
@seohong_park
Seohong Park
2 months
Q-learning is not yet scalable. I wrote a blog post about my thoughts on scalable RL algorithms. To be clear, I'm still highly optimistic about off-policy RL and Q-learning! I just think we haven't found the right solution yet (the post discusses why).
Tweet media one
1
0
7
@RuiYang70669025
Rui Yang
2 months
RT @jackbai_jkb: 🧵 1/7 Should AI agents "think more" or "do more"? 🤔. The current trend is to scale test-time compute, making agents genera….
0
17
0
@RuiYang70669025
Rui Yang
2 months
RT @FengLuo895614: 🚀 Can LLMs stop overthinking when detailed reasoning isn't needed?.Excited to share our latest work on LLM reasoning: Au….
0
4
0
@RuiYang70669025
Rui Yang
2 months
Excited to share that EmbodiedBench was selected for an Oral at ICML 2025!. We recently added results for new models (InternVL3, Gemma3, Ovis2) and released a large agent trajectory dataset on 🤗: Try training and evaluating your MLLM for embodied agents!
Tweet media one
@RuiYang70669025
Rui Yang
6 months
🤖Can MLLM agents reason about spatial relationships and plan atomic actions for navigation & manipulation?. 🔥 Meet EmbodiedBench 🏆—the first fine-grained benchmark for MLLM-based embodied agents!. 📄 Paper: 🌐 Website & code:
Tweet media one
2
21
93
@RuiYang70669025
Rui Yang
2 months
RT @5000hui: 🚀 Excited to share GUI-Actor—a new approach for GUI grounding! Big thanks to @_akhaliq for featuring our work!.🌐 Project page:….
0
29
0
@RuiYang70669025
Rui Yang
2 months
Thanks for sharing our work!GUI-Actor is a new GUI grounding method that combines an attention-based action head with a grounding verifier, different from previous text-based coordinate prediction methods.
@_akhaliq
AK
2 months
Microsoft just dropped GUI-Actor on Hugging Face. Coordinate-Free Visual Grounding for GUI Agents
Tweet media one
0
3
15
@RuiYang70669025
Rui Yang
2 months
RT @shizhediao: Does RL truly expand a model’s reasoning🧠capabilities? Contrary to recent claims, the answer is yes—if you push RL training….
0
66
0
@RuiYang70669025
Rui Yang
2 months
RT @qiancheng1231: 📢 New Paper Drop: From Solving to Modeling!.LLMs can solve math problems — but can they model the real world? 🌍. 📄 arXiv….
0
30
0
@RuiYang70669025
Rui Yang
2 months
RT @taiwei_shi: Is there anything that Qwen cannot do at this point? 😂
Tweet media one
0
75
0
@RuiYang70669025
Rui Yang
2 months
RT @hendrydong: 🚀 A unified strategy for parallel decoding: Fractured CoT Reasoning.We explore three dims of sampling:.- Reasoning trajecto….
0
23
0