Haibin @eric_haibin_lin X Profile

Haibin

@eric_haibin_lin

Followers

2K

Following

357

Media

27

Statuses

336

Bytedance Seed LLM Systems, @verl_project, author of Megascale. Prev. Amazon AI, ex-committer for BytePS, Gluon-NLP, @ApacheMXNet

Joined November 2012

Don't wanna be here? Send us removal request.

Haibin

@eric_haibin_lin

10 hours

RT @verl_project: The 1st verl meetup will be held at ICML Vancouver on July 16th! Please join us if you will be there! .

0

2

0

Haibin

@eric_haibin_lin

4 days

RT @verl_project: If you're in Singapore on 7/11, do not miss this meetup! Talks from the verl community: .- LLMs to optimize code performa….

0

4

0

Haibin

@eric_haibin_lin

7 days

RT @Agentica_: 🚀 Introducing DeepSWE 🤖: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B.….

0

65

0

Haibin

@eric_haibin_lin

1 month

🚀🚀🚀.

verl project

@verl_project

1 month

DeepSeek 671b and Qwen3 236b support with Megatron backend is now available as preview in verl v0.4.0 🔥🔥🔥.We will continue optimizing MoE model performance down the road. DeepSeek 671b: .verl v0.4:

0

21

Haibin

@eric_haibin_lin

1 month

RT @InfiniAILab: 🥳 Happy to share our new work – Kinetics: Rethinking Test-Time Scaling Laws. 🤔How to effectively build a powerful reasoni….

0

66

0

Haibin

@eric_haibin_lin

1 month

RT @DongfuJiang: Introducing VerlTool - a unified and easy-to-extend tool agent training framework based on verl. Recently, there's been a….

0

62

0

Haibin

@eric_haibin_lin

1 month

RT @verl_project: Multi-GPU LoRA RL is now available in verl! It enables 70B+ model RL with 8 GPUs in bf16. Getting started: https://t.co….

0

9

0

Haibin

@eric_haibin_lin

1 month

RT @langfengq: "verl-agent" is also the official repo for paper "Group-in-Group Policy Optimization for LLM Agent Training"..

0

1

0

Haibin

@eric_haibin_lin

2 months

RT @verl_project: As a community we continue to provide popular open source post-training recipes. Recently we worked together with the com….

0

4

0

Haibin

@eric_haibin_lin

2 months

SkyRL is a great work extending @verl_project with environments for agent tasks. It leverages the sglang multi-turn/tool calling feature recently added to verl:

NovaSky

@NovaSkyAI

2 months

1/N Introducing SkyRL-v0, our RL training pipeline enabling efficient RL training for long-horizon, real-environment tasks like SWE-Bench. We also open-source a series of our early trained models to showcase the potential of end-to-end online RL training on long-horizon (20-50

1

28

179

Haibin

@eric_haibin_lin

2 months

Very efficient yet powerful 8B coding model trained with @verl_project. Check it out! .

Yuyu Zhang

@_yuyuzhang

2 months

🚀 Thrilled to introduce Seed-Coder, our new open-source family of 🔥powerful, 🔍transparent, ⚡parameter-efficient code models at 8B scale!. 💥 Small model, big results!.🥇 BigCodeBench, FullStack Bench, MHPP: impressive results among lightweight open-source models, including

1

10

74

Haibin

@eric_haibin_lin

2 months

RT @verl_project: verl is embracing @PyTorch fsdp2! Better throughput, memory usage, and composability with torch.compile!. Please try it o….

0

6

0

Haibin

@eric_haibin_lin

2 months

RT @verl_project: qwen3 with both Megatron and FSDP reinforcement learning support is now available in verl!. https….

0

11

0

Haibin

@eric_haibin_lin

2 months

RT @verl_project: verl provides day 1 RL support for qwen3 models! Both sequence parallelism and remove padding acceleration are available.….

0

13

0

Haibin

@eric_haibin_lin

2 months

RT @tongyx361: Welcome to enjoy ICLR Expo Talk Panel "verl: Flexible and Efficient Infrastructures for Post-training LLMs" from ByteDance S….

0

11

0

Haibin

@eric_haibin_lin

3 months

RT @verl_project: Deploy verl on AMD GPUs for fast, scalable RLHF training with ROCm optimization, docker scripts, and impressive throughpu….

0

4

0

Haibin

@eric_haibin_lin

3 months

RT @qiying_yu: #ICLR2025.I am going to present VAPO & DAPO twice at ICLR, two SOTA LLM RL algorithms. 1. The 1-2 pm verl Expo Talk, Apr 2….

0

3

0

Haibin

@eric_haibin_lin

3 months

RT @verl_project: We will present latest updates of verl at #ICLR2025:.- recent RL recipes (DAPO, etc).- RL with tool calling & multi-turn….

0

15

0

Haibin

@eric_haibin_lin

3 months

Nice work training tool-augmented RL built on top of @verl_project and the DAPO algorithm. And The infrastructure for tool use / multi-turn RL is under review. Feedback is welcome! .

AK

@_akhaliq

3 months

ReTool. Reinforcement Learning for Strategic Tool Use in LLMs

1

25

172

Haibin

@eric_haibin_lin

3 months

RT @lmsysorg: Hello, everyone! Get ready for an electrifying session this Saturday at 6 PM Pacific Time (9 PM Eastern Time) or Sunday at 9….

0

23

0