shiyi_c98 Profile Banner
Shiyi Cao Profile
Shiyi Cao

@shiyi_c98

Followers
2K
Following
503
Media
7
Statuses
85

Building Self-evolving Coding Agent | PhD student @UCBerkeley @BerkeleySky, MSc @ETH, B.S @sjtu1896, llm and system | Prev. Intern @nvidia

Berkeley, CA
Joined February 2019
Don't wanna be here? Send us removal request.
@shiyi_c98
Shiyi Cao
27 days
1/n ๐Ÿš€ Introducing SkyRL-Agent, a framework for efficient RL agent training. โšก 1.55ร— faster async rollout dispatch ๐Ÿ›  Lightweight tool + task integration ๐Ÿ”„ Backend-agnostic (SkyRL-train / VeRL / Tinker) ๐Ÿ† Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% โ†’ 39.4%
5
60
274
@Jintao_Zhang_
Jintao Zhang
8 days
TurboDiffusion: 100โ€“205ร— faster video generation on a single RTX 5090 ๐Ÿš€ Only takes 1.8s to generate a high-quality 5-second video. The key to both high speed and high quality? ๐Ÿ˜SageAttention + Sparse-Linear Attention (SLA) + rCM Github: https://t.co/ybbNBjgHFP Technical
28
154
833
@shiyi_c98
Shiyi Cao
22 days
๐Ÿค– I am in San Diego for #NeurIPS2025 this week! Excited to chat about SkyRL(-Agent), Coding LLM/Agent, Self-evolving Agent, RL, and Inference/Training Infrastructure.
5
3
60
@shiyi_c98
Shiyi Cao
26 days
3
0
10
@DachengLi177
Dacheng Li
27 days
Check out our efficient infra for agentic RL training! More applications coming soon!๐Ÿ”ฅ
@shiyi_c98
Shiyi Cao
27 days
1/n ๐Ÿš€ Introducing SkyRL-Agent, a framework for efficient RL agent training. โšก 1.55ร— faster async rollout dispatch ๐Ÿ›  Lightweight tool + task integration ๐Ÿ”„ Backend-agnostic (SkyRL-train / VeRL / Tinker) ๐Ÿ† Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% โ†’ 39.4%
0
3
46
@BeidiChen
Beidi Chen
27 days
Finally out! ๐Ÿ˜
@shiyi_c98
Shiyi Cao
27 days
1/n ๐Ÿš€ Introducing SkyRL-Agent, a framework for efficient RL agent training. โšก 1.55ร— faster async rollout dispatch ๐Ÿ›  Lightweight tool + task integration ๐Ÿ”„ Backend-agnostic (SkyRL-train / VeRL / Tinker) ๐Ÿ† Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% โ†’ 39.4%
1
6
81
@fangz_zzu
FangZhou Zhao
27 days
Proud to have contributed to SkyRL-Agent as an undergrad! Huge thanks to @shiyi_c98 and @DachengLi177 for all the guidance, learned a lot from this project. More details in the thread
@shiyi_c98
Shiyi Cao
27 days
1/n ๐Ÿš€ Introducing SkyRL-Agent, a framework for efficient RL agent training. โšก 1.55ร— faster async rollout dispatch ๐Ÿ›  Lightweight tool + task integration ๐Ÿ”„ Backend-agnostic (SkyRL-train / VeRL / Tinker) ๐Ÿ† Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% โ†’ 39.4%
0
2
4
@shiyi_c98
Shiyi Cao
27 days
7/n โ€” Acknowledgements This work is developed in @BerkeleySky. In addition to the authors, we would like to thank all related open-source projects, and generous compute support from @anyscalecompute @awscloud @LambdaAPI @thinkymachines.
1
0
11
@shiyi_c98
Shiyi Cao
27 days
6/n โ€” Join the Efforts & Roadmap SkyRL-Agent is a framework for efficient agent training. Looking forward, we are building: ๐Ÿ“Œ multi-agent training ๐Ÿ“Œ multi-domain training ๐Ÿ“Œ self-improving agents with runtime evolution Itโ€™s still an early stage, please join us to build
1
1
13
@shiyi_c98
Shiyi Cao
27 days
5/n โ€” Other Case Studies SkyRL-Agent is not just for SWE. We also provide training examples for: ๐Ÿง  Deep Research Agent (document reasoning & evidence retrieval) ๐Ÿ–ฅ Computer Use Agent (OS operations) ๐Ÿ“ Memory Agent (memory management for long-context tasks) More recipes are
1
1
14
@shiyi_c98
Shiyi Cao
27 days
4/n Using SkyRL-Agent, we trained SA-SWE-32B purely with RL from Qwen3-32B. Training recipe highlights: ๐Ÿ”น Trained with an AST-based search tool for better code navigation โ†’ โšก Higher Pass@K & sample efficiency due to improved tooling ๐Ÿ”น Trained on 4.5K R2E-Gym instances,
1
1
12
@shiyi_c98
Shiyi Cao
27 days
3/n For SWE agent training, we use the Async Pipeline Dispatching method, which improves rollout throughput by 1.55ร— over naive async batching. Instead of leaving the GPU idle during CPU-bound stages (init, reward compute, etc.), pipeline execution better overlaps CPU + GPU
1
1
9
@shiyi_c98
Shiyi Cao
27 days
2/n SkyRL-Agent is built around three key components: ๐Ÿงฉ Tool-centric task interface โ€ƒSupports dynamic registration of stateless tools, environment-modifying actions, and agent-state-modifying operations under a unified abstraction. โšก Efficient rollout scheduling โ€ƒFine-grained
1
1
13
@shiyi_c98
Shiyi Cao
27 days
Amazing project! Want to see if we can integrate the environment into SkyRL.๐Ÿคฏ๐Ÿคฏ๐Ÿคฏ
@BeidiChen
Beidi Chen
27 days
๐Ÿ“˜ Holiday read! From Software Engineer to AI Environment Architect ๐Ÿš€ Tldr of our blog: We see an exciting future where engineers ๐Ÿ‘ฉโ€๐Ÿ’ป wonโ€™t stop coding โ€” but the highest leverage shifts to designing the environments ๐Ÿ› where AI can think, build, and evolve. ๐ŸŽฌ Demo: Inspired
0
3
24
@FangYunhao_X
Yunhao Fang
1 month
A start toward real multimodality: an agent that can perceive, reason, and act in real time within open-world environments for hours. ๐ŸŽฌProject page: https://t.co/S8tyXNbRTv ๐Ÿ“„Paper: https://t.co/Xu1Ysbw8Le More details: https://t.co/ZeaMFTwIrk Kudos to the team : )
@WeihaoTan64
Weihao Tan
1 month
๐Ÿš€Introducing Lumine, a generalist AI agent trained within Genshin Impact that can perceive, reason, and act in real time, completing hours-long missions and following diverse instructions within complex 3D open-world environments.๐ŸŽฎ Website: https://t.co/UxSwNKGZml 1/6
1
3
8
@LaudeInstitute
Laude Institute
2 months
Meet Slingshots // One. This inaugural batch includes leading-edge researchers advancing the science and practice of AI - with benchmarks, frameworks, and agents that ship real impact into the world. We're honored to support research from: @alexgshaw @Mike_A_Merrill
2
18
65
@pcmoritz
Philipp Moritz
2 months
We are happy to release SkyRL tx 0.1 https://t.co/PSOuZciiGw, an open source unified training and inference engine that supports the Tinker API. This release has many performance enhancements and also new features but most importantly RL training is now working end-to-end. If you
4
12
78
@NovaSkyAI
NovaSky
2 months
SkyRL just crossed 1000 Github stars! Thank you to all the wonderful contributors and users building this project together ๐Ÿฅณ Check it out: https://t.co/CWlKue79JH
0
6
38
@shiyi_c98
Shiyi Cao
2 months
Honored and grateful to receive the Amazon AI Fellowship! Huge thanks to @AmazonScience for the supportโ€”excited for the journey ahead ๐Ÿ’™๐Ÿ’›
@Berkeley_EECS
UC Berkeley EECS
2 months
Amazing! 10 @BerkeleyEECS @SkyCompLab grad students are Amazon AI PhD Fellows! Congrats! Learn more about our fellows here: https://t.co/zuCGKlmSNe #AmazonAIFellowship @BerkeleySky
2
1
45