Haibin Profile
Haibin

@eric_haibin_lin

Followers
2K
Following
357
Media
27
Statuses
336

Bytedance Seed LLM Systems, @verl_project, author of Megascale. Prev. Amazon AI, ex-committer for BytePS, Gluon-NLP, @ApacheMXNet

Joined November 2012
Don't wanna be here? Send us removal request.
@eric_haibin_lin
Haibin
10 hours
RT @verl_project: The 1st verl meetup will be held at ICML Vancouver on July 16th! Please join us if you will be there! .
0
2
0
@eric_haibin_lin
Haibin
4 days
RT @verl_project: If you're in Singapore on 7/11, do not miss this meetup! Talks from the verl community: .- LLMs to optimize code performa….
0
4
0
@eric_haibin_lin
Haibin
7 days
RT @Agentica_: 🚀 Introducing DeepSWE 🤖: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B.….
0
65
0
@eric_haibin_lin
Haibin
1 month
🚀🚀🚀.
@verl_project
verl project
1 month
DeepSeek 671b and Qwen3 236b support with Megatron backend is now available as preview in verl v0.4.0 🔥🔥🔥.We will continue optimizing MoE model performance down the road. DeepSeek 671b: .verl v0.4:
Tweet media one
0
0
21
@eric_haibin_lin
Haibin
1 month
RT @InfiniAILab: 🥳 Happy to share our new work –  Kinetics: Rethinking Test-Time Scaling Laws. 🤔How to effectively build a powerful reasoni….
0
66
0
@eric_haibin_lin
Haibin
1 month
RT @DongfuJiang: Introducing VerlTool - a unified and easy-to-extend tool agent training framework based on verl. Recently, there's been a….
0
62
0
@eric_haibin_lin
Haibin
1 month
RT @verl_project: Multi-GPU LoRA RL is now available in verl! It enables 70B+ model RL with 8 GPUs in bf16. Getting started: https://t.co….
0
9
0
@eric_haibin_lin
Haibin
1 month
RT @langfengq: "verl-agent" is also the official repo for paper "Group-in-Group Policy Optimization for LLM Agent Training"..
0
1
0
@eric_haibin_lin
Haibin
2 months
RT @verl_project: As a community we continue to provide popular open source post-training recipes. Recently we worked together with the com….
0
4
0
@eric_haibin_lin
Haibin
2 months
SkyRL is a great work extending @verl_project with environments for agent tasks. It leverages the sglang multi-turn/tool calling feature recently added to verl:
@NovaSkyAI
NovaSky
2 months
1/N Introducing SkyRL-v0, our RL training pipeline enabling efficient RL training for long-horizon, real-environment tasks like SWE-Bench. We also open-source a series of our early trained models to showcase the potential of end-to-end online RL training on long-horizon (20-50
Tweet media one
1
28
179
@eric_haibin_lin
Haibin
2 months
Very efficient yet powerful 8B coding model trained with @verl_project. Check it out! .
@_yuyuzhang
Yuyu Zhang
2 months
🚀 Thrilled to introduce Seed-Coder, our new open-source family of 🔥powerful, 🔍transparent, ⚡parameter-efficient code models at 8B scale!. 💥 Small model, big results!.🥇 BigCodeBench, FullStack Bench, MHPP: impressive results among lightweight open-source models, including
Tweet media one
1
10
74
@eric_haibin_lin
Haibin
2 months
RT @verl_project: verl is embracing @PyTorch fsdp2! Better throughput, memory usage, and composability with torch.compile!. Please try it o….
0
6
0
@eric_haibin_lin
Haibin
2 months
RT @verl_project: qwen3 with both Megatron and FSDP reinforcement learning support is now available in verl!. https….
0
11
0
@eric_haibin_lin
Haibin
2 months
RT @verl_project: verl provides day 1 RL support for qwen3 models! Both sequence parallelism and remove padding acceleration are available.….
0
13
0
@eric_haibin_lin
Haibin
2 months
RT @tongyx361: Welcome to enjoy ICLR Expo Talk Panel "verl: Flexible and Efficient Infrastructures for Post-training LLMs" from ByteDance S….
0
11
0
@eric_haibin_lin
Haibin
3 months
RT @verl_project: Deploy verl on AMD GPUs for fast, scalable RLHF training with ROCm optimization, docker scripts, and impressive throughpu….
0
4
0
@eric_haibin_lin
Haibin
3 months
RT @qiying_yu: #ICLR2025.I am going to present VAPO & DAPO twice at ICLR, two SOTA LLM RL algorithms. 1. The 1-2 pm verl Expo Talk, Apr 2….
0
3
0
@eric_haibin_lin
Haibin
3 months
RT @verl_project: We will present latest updates of verl at #ICLR2025:.- recent RL recipes (DAPO, etc).- RL with tool calling & multi-turn….
0
15
0
@eric_haibin_lin
Haibin
3 months
Nice work training tool-augmented RL built on top of @verl_project and the DAPO algorithm. And The infrastructure for tool use / multi-turn RL is under review. Feedback is welcome! .
@_akhaliq
AK
3 months
ReTool. Reinforcement Learning for Strategic Tool Use in LLMs
Tweet media one
1
25
172
@eric_haibin_lin
Haibin
3 months
RT @lmsysorg: Hello, everyone! Get ready for an electrifying session this Saturday at 6 PM Pacific Time (9 PM Eastern Time) or Sunday at 9….
0
23
0