Shiyi Cao
@shiyi_c98
Followers
2K
Following
503
Media
7
Statuses
85
Building Self-evolving Coding Agent | PhD student @UCBerkeley @BerkeleySky, MSc @ETH, B.S @sjtu1896, llm and system | Prev. Intern @nvidia
Berkeley, CA
Joined February 2019
1/n ๐ Introducing SkyRL-Agent, a framework for efficient RL agent training. โก 1.55ร faster async rollout dispatch ๐ Lightweight tool + task integration ๐ Backend-agnostic (SkyRL-train / VeRL / Tinker) ๐ Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% โ 39.4%
5
60
274
TurboDiffusion: 100โ205ร faster video generation on a single RTX 5090 ๐ Only takes 1.8s to generate a high-quality 5-second video. The key to both high speed and high quality? ๐SageAttention + Sparse-Linear Attention (SLA) + rCM Github: https://t.co/ybbNBjgHFP Technical
28
154
833
๐ค I am in San Diego for #NeurIPS2025 this week! Excited to chat about SkyRL(-Agent), Coding LLM/Agent, Self-evolving Agent, RL, and Inference/Training Infrastructure.
5
3
60
@BerkeleySky @anyscalecompute @awscloud @LambdaAPI @thinkymachines @DachengLi177 @fangz_zzu @sumanthrh @connorzchen @charlie_ruan @tyler_griggs_ @shulynnliu @erictang000 @CyrusHakha @richliaw @pcmoritz @matei_zaharia @profjoeyg @istoica 9/n โ One more thank-you tweet because yโall earned it. Thanks to @jyangballin @StringChaos @jiayi_pirate @xingyaow_ for valuable and helpful feedback and discussions๐ฅ
3
0
10
Check out our efficient infra for agentic RL training! More applications coming soon!๐ฅ
1/n ๐ Introducing SkyRL-Agent, a framework for efficient RL agent training. โก 1.55ร faster async rollout dispatch ๐ Lightweight tool + task integration ๐ Backend-agnostic (SkyRL-train / VeRL / Tinker) ๐ Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% โ 39.4%
0
3
46
Finally out! ๐
1/n ๐ Introducing SkyRL-Agent, a framework for efficient RL agent training. โก 1.55ร faster async rollout dispatch ๐ Lightweight tool + task integration ๐ Backend-agnostic (SkyRL-train / VeRL / Tinker) ๐ Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% โ 39.4%
1
6
81
@BerkeleySky @anyscalecompute @awscloud @LambdaAPI @thinkymachines 8/n โ Continued thanks. We also want to recognize the incredible team behind this work: @shiyi_c98 @DachengLi177 @fangz_zzu Shuo Yuan @sumanthrh @connorzchen @charlie_ruan
@tyler_griggs_ @shulynnliu @erictang000 @CyrusHakha @richliaw @pcmoritz
@matei_zaharia @profjoeyg @istoica
1
0
14
Proud to have contributed to SkyRL-Agent as an undergrad! Huge thanks to @shiyi_c98 and @DachengLi177 for all the guidance, learned a lot from this project. More details in the thread
1/n ๐ Introducing SkyRL-Agent, a framework for efficient RL agent training. โก 1.55ร faster async rollout dispatch ๐ Lightweight tool + task integration ๐ Backend-agnostic (SkyRL-train / VeRL / Tinker) ๐ Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% โ 39.4%
0
2
4
7/n โ Acknowledgements This work is developed in @BerkeleySky. In addition to the authors, we would like to thank all related open-source projects, and generous compute support from @anyscalecompute @awscloud @LambdaAPI @thinkymachines.
1
0
11
6/n โ Join the Efforts & Roadmap SkyRL-Agent is a framework for efficient agent training. Looking forward, we are building: ๐ multi-agent training ๐ multi-domain training ๐ self-improving agents with runtime evolution Itโs still an early stage, please join us to build
1
1
13
5/n โ Other Case Studies SkyRL-Agent is not just for SWE. We also provide training examples for: ๐ง Deep Research Agent (document reasoning & evidence retrieval) ๐ฅ Computer Use Agent (OS operations) ๐ Memory Agent (memory management for long-context tasks) More recipes are
1
1
14
4/n Using SkyRL-Agent, we trained SA-SWE-32B purely with RL from Qwen3-32B. Training recipe highlights: ๐น Trained with an AST-based search tool for better code navigation โ โก Higher Pass@K & sample efficiency due to improved tooling ๐น Trained on 4.5K R2E-Gym instances,
1
1
12
3/n For SWE agent training, we use the Async Pipeline Dispatching method, which improves rollout throughput by 1.55ร over naive async batching. Instead of leaving the GPU idle during CPU-bound stages (init, reward compute, etc.), pipeline execution better overlaps CPU + GPU
1
1
9
2/n SkyRL-Agent is built around three key components: ๐งฉ Tool-centric task interface โSupports dynamic registration of stateless tools, environment-modifying actions, and agent-state-modifying operations under a unified abstraction. โก Efficient rollout scheduling โFine-grained
1
1
13
Amazing project! Want to see if we can integrate the environment into SkyRL.๐คฏ๐คฏ๐คฏ
๐ Holiday read! From Software Engineer to AI Environment Architect ๐ Tldr of our blog: We see an exciting future where engineers ๐ฉโ๐ป wonโt stop coding โ but the highest leverage shifts to designing the environments ๐ where AI can think, build, and evolve. ๐ฌ Demo: Inspired
0
3
24
A start toward real multimodality: an agent that can perceive, reason, and act in real time within open-world environments for hours. ๐ฌProject page: https://t.co/S8tyXNbRTv ๐Paper: https://t.co/Xu1Ysbw8Le More details: https://t.co/ZeaMFTwIrk Kudos to the team : )
๐Introducing Lumine, a generalist AI agent trained within Genshin Impact that can perceive, reason, and act in real time, completing hours-long missions and following diverse instructions within complex 3D open-world environments.๐ฎ Website: https://t.co/UxSwNKGZml 1/6
1
3
8
Meet Slingshots // One. This inaugural batch includes leading-edge researchers advancing the science and practice of AI - with benchmarks, frameworks, and agents that ship real impact into the world. We're honored to support research from: @alexgshaw @Mike_A_Merrill
2
18
65
We are happy to release SkyRL tx 0.1 https://t.co/PSOuZciiGw, an open source unified training and inference engine that supports the Tinker API. This release has many performance enhancements and also new features but most importantly RL training is now working end-to-end. If you
4
12
78
SkyRL just crossed 1000 Github stars! Thank you to all the wonderful contributors and users building this project together ๐ฅณ Check it out: https://t.co/CWlKue79JH
0
6
38
Honored and grateful to receive the Amazon AI Fellowship! Huge thanks to @AmazonScience for the supportโexcited for the journey ahead ๐๐
Amazing! 10 @BerkeleyEECS @SkyCompLab grad students are Amazon AI PhD Fellows! Congrats! Learn more about our fellows here: https://t.co/zuCGKlmSNe
#AmazonAIFellowship
@BerkeleySky
2
1
45