Shiyi Cao @shiyi_c98 X Profile

Shiyi Cao

@shiyi_c98

Followers

2K

Following

503

Media

7

Statuses

85

Building Self-evolving Coding Agent | PhD student @UCBerkeley @BerkeleySky, MSc @ETH, B.S @sjtu1896, llm and system | Prev. Intern @nvidia

https://t.co/qnW3a8Basr

Berkeley, CA

Joined February 2019

Don't wanna be here? Send us removal request.

Shiyi Cao

@shiyi_c98

27 days

1/n 🚀 Introducing SkyRL-Agent, a framework for efficient RL agent training. ⚡ 1.55× faster async rollout dispatch 🛠 Lightweight tool + task integration 🔄 Backend-agnostic (SkyRL-train / VeRL / Tinker) 🏆 Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% → 39.4%

5

60

274

Jintao Zhang

@Jintao_Zhang_

8 days

TurboDiffusion: 100–205× faster video generation on a single RTX 5090 🚀 Only takes 1.8s to generate a high-quality 5-second video. The key to both high speed and high quality? 😍SageAttention + Sparse-Linear Attention (SLA) + rCM Github: https://t.co/ybbNBjgHFP Technical

28

154

833

Shiyi Cao

@shiyi_c98

22 days

🤖 I am in San Diego for #NeurIPS2025 this week! Excited to chat about SkyRL(-Agent), Coding LLM/Agent, Self-evolving Agent, RL, and Inference/Training Infrastructure.

5

3

60

Shiyi Cao

@shiyi_c98

26 days

@BerkeleySky @anyscalecompute @awscloud @LambdaAPI @thinkymachines @DachengLi177 @fangz_zzu @sumanthrh @connorzchen @charlie_ruan @tyler_griggs_ @shulynnliu @erictang000 @CyrusHakha @richliaw @pcmoritz @matei_zaharia @profjoeyg @istoica 9/n — One more thank-you tweet because y’all earned it. Thanks to @jyangballin @StringChaos @jiayi_pirate @xingyaow_ for valuable and helpful feedback and discussions🔥

3

0

10

Dacheng Li

@DachengLi177

27 days

Check out our efficient infra for agentic RL training! More applications coming soon!🔥

Shiyi Cao

@shiyi_c98

27 days

1/n 🚀 Introducing SkyRL-Agent, a framework for efficient RL agent training. ⚡ 1.55× faster async rollout dispatch 🛠 Lightweight tool + task integration 🔄 Backend-agnostic (SkyRL-train / VeRL / Tinker) 🏆 Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% → 39.4%

0

3

46

Beidi Chen

@BeidiChen

27 days

Finally out! 😁

Shiyi Cao

@shiyi_c98

27 days

1/n 🚀 Introducing SkyRL-Agent, a framework for efficient RL agent training. ⚡ 1.55× faster async rollout dispatch 🛠 Lightweight tool + task integration 🔄 Backend-agnostic (SkyRL-train / VeRL / Tinker) 🏆 Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% → 39.4%

1

6

81

Shiyi Cao

@shiyi_c98

27 days

@BerkeleySky @anyscalecompute @awscloud @LambdaAPI @thinkymachines 8/n — Continued thanks. We also want to recognize the incredible team behind this work: @shiyi_c98 @DachengLi177 @fangz_zzu Shuo Yuan @sumanthrh @connorzchen @charlie_ruan @tyler_griggs_ @shulynnliu @erictang000 @CyrusHakha @richliaw @pcmoritz @matei_zaharia @profjoeyg @istoica

1

0

14

FangZhou Zhao

@fangz_zzu

27 days

Proud to have contributed to SkyRL-Agent as an undergrad! Huge thanks to @shiyi_c98 and @DachengLi177 for all the guidance, learned a lot from this project. More details in the thread

Shiyi Cao

@shiyi_c98

27 days

1/n 🚀 Introducing SkyRL-Agent, a framework for efficient RL agent training. ⚡ 1.55× faster async rollout dispatch 🛠 Lightweight tool + task integration 🔄 Backend-agnostic (SkyRL-train / VeRL / Tinker) 🏆 Used to train SA-SWE-32B, improving Qwen3-32B from 24.4% → 39.4%

0

2

4

Shiyi Cao

@shiyi_c98

27 days

7/n — Acknowledgements This work is developed in @BerkeleySky. In addition to the authors, we would like to thank all related open-source projects, and generous compute support from @anyscalecompute @awscloud @LambdaAPI @thinkymachines.

1

0

11

Shiyi Cao

@shiyi_c98

27 days

6/n — Join the Efforts & Roadmap SkyRL-Agent is a framework for efficient agent training. Looking forward, we are building: 📌 multi-agent training 📌 multi-domain training 📌 self-improving agents with runtime evolution It’s still an early stage, please join us to build

1

13

Shiyi Cao

@shiyi_c98

27 days

5/n — Other Case Studies SkyRL-Agent is not just for SWE. We also provide training examples for: 🧠 Deep Research Agent (document reasoning & evidence retrieval) 🖥 Computer Use Agent (OS operations) 📝 Memory Agent (memory management for long-context tasks) More recipes are

1

14

Shiyi Cao

@shiyi_c98

27 days

4/n Using SkyRL-Agent, we trained SA-SWE-32B purely with RL from Qwen3-32B. Training recipe highlights: 🔹 Trained with an AST-based search tool for better code navigation → ⚡ Higher Pass@K & sample efficiency due to improved tooling 🔹 Trained on 4.5K R2E-Gym instances,

1

12

Shiyi Cao

@shiyi_c98

27 days

3/n For SWE agent training, we use the Async Pipeline Dispatching method, which improves rollout throughput by 1.55× over naive async batching. Instead of leaving the GPU idle during CPU-bound stages (init, reward compute, etc.), pipeline execution better overlaps CPU + GPU

1

9

Shiyi Cao

@shiyi_c98

27 days

2/n SkyRL-Agent is built around three key components: 🧩 Tool-centric task interface Supports dynamic registration of stateless tools, environment-modifying actions, and agent-state-modifying operations under a unified abstraction. ⚡ Efficient rollout scheduling Fine-grained

1

13

Shiyi Cao

@shiyi_c98

27 days

Amazing project! Want to see if we can integrate the environment into SkyRL.🤯🤯🤯

Beidi Chen

@BeidiChen

27 days

📘 Holiday read! From Software Engineer to AI Environment Architect 🚀 Tldr of our blog: We see an exciting future where engineers 👩‍💻 won’t stop coding — but the highest leverage shifts to designing the environments 🛝 where AI can think, build, and evolve. 🎬 Demo: Inspired

0

3

24

Yunhao Fang

@FangYunhao_X

1 month

A start toward real multimodality: an agent that can perceive, reason, and act in real time within open-world environments for hours. 🎬Project page: https://t.co/S8tyXNbRTv 📄Paper: https://t.co/Xu1Ysbw8Le More details: https://t.co/ZeaMFTwIrk Kudos to the team : )

Weihao Tan

@WeihaoTan64

1 month

🚀Introducing Lumine, a generalist AI agent trained within Genshin Impact that can perceive, reason, and act in real time, completing hours-long missions and following diverse instructions within complex 3D open-world environments.🎮 Website: https://t.co/UxSwNKGZml 1/6

1

3

8

Laude Institute

@LaudeInstitute

2 months

Meet Slingshots // One. This inaugural batch includes leading-edge researchers advancing the science and practice of AI - with benchmarks, frameworks, and agents that ship real impact into the world. We're honored to support research from: @alexgshaw @Mike_A_Merrill

2

18

65

Philipp Moritz

@pcmoritz

2 months

We are happy to release SkyRL tx 0.1 https://t.co/PSOuZciiGw, an open source unified training and inference engine that supports the Tinker API. This release has many performance enhancements and also new features but most importantly RL training is now working end-to-end. If you

4

12

78

NovaSky

@NovaSkyAI

2 months

SkyRL just crossed 1000 Github stars! Thank you to all the wonderful contributors and users building this project together 🥳 Check it out: https://t.co/CWlKue79JH

0

6

38

Shiyi Cao

@shiyi_c98

2 months

Honored and grateful to receive the Amazon AI Fellowship! Huge thanks to @AmazonScience for the support—excited for the journey ahead 💙💛

UC Berkeley EECS

@Berkeley_EECS

2 months

Amazing! 10 @BerkeleyEECS @SkyCompLab grad students are Amazon AI PhD Fellows! Congrats! Learn more about our fellows here: https://t.co/zuCGKlmSNe #AmazonAIFellowship @BerkeleySky

2

1

45