
Jiaheng Liu
@liujiaheng2
Followers
35
Following
12
Media
3
Statuses
42
Assistant Professor @ Nanjing University Core Member @ M-A-P AGI/LLM Researcher
Joined March 2018
RT @zhongyuan_peng: 🚀 CriticLean Announcement.[1/6].Breaking News: ByteDance Seed & Nanjing University introduce #CriticLean, a groundbreak….
0
4
0
RT @omarsar0: A Survey of Latent Reasoning. Nice overview on the emerging field of latent reasoning. Great read for AI devs. (bookmark it….
0
152
0
RT @RidgerZhu: [1/7]. Excited to share our new survey on Latent Reasoning! The field is buzzing with methods—looping, recurrence, continuou….
0
50
0
RT @iScienceLuvr: Alibaba's RL LLM training library: ROLL. "We introduce ROLL, an efficient, scalable, and user-friendly library designed….
0
68
0
RT @GeZhang86038849: [1/n].🚨 Game On for LLM Reasoning—Meet KORGym! 🎮✨. Ever wondered how to truly assess an LLM’s reasoning ability beyond….
0
10
0
RT @Synced_Global: 🔥 Flow-GRPO: Training Flow Matching Models via Online RL 🎨. This work introduces Flow-GRPO, the first method to embed on….
0
2
0
RT @iScienceLuvr: Flow-GRPO: Training Flow Matching Models via Online RL. "We propose Flow-GRPO, the first method integrating online reinfo….
0
82
0
RT @WenhuChen: 🔥 How do you build a state-of-the-art Vision-Language Model with direct RL?. We’re excited to introduce VL-Rethinker, a new….
0
61
0
RT @dwzhu128: [1/n].Super excited to introduce our comprehensive survey on Long Context Language Models (LCLM), a collaborative effort betw….
0
81
0
RT @GeZhang86038849: [1/n].🌟 Introducing #DeltaBench: The first benchmark focused on evaluating the critique abilities to detect errors in….
0
1
0
RT @zhngchn95319950: 💥 CodeCriticBench: The Ultimate LLM Code Critique Test!. 🚀 Tests code gen & QA (CodeForces, MBPP, StackOverflow).✔️ 10….
0
5
0
RT @sivil_taram: 🎉 Announcing the first Open Science for Foundation Models (SCI-FM) Workshop at #ICLR2025! Join us in advancing transparenc….
0
45
0
RT @siweiwu7: 1/n Excited to announce the release of our new paper "A Comparative Study on Reasoning Patterns of OpenAI's o1 Model" https:….
0
63
0
RT @ZenMoore1: Introducing MTU-Bench: A Multi-Granularity Tool-Use Benchmark for Large Language Models. arxiv: dai….
0
2
0
RT @jie_liu1: 🚀Excited to share our Storm-7B🌪️. This model achieves a 50.5% length-controlled win rate against GPT-4 Preview, making it the….
0
18
0
RT @GeZhang86038849: [1/n].New Benchmark Alert!.LongIns ( is a little "brother" of LongICLBench (.
0
4
0