Hao Sun - RL @HolarisSun X Profile

Hao Sun - RL

@HolarisSun

Followers

890

Following

537

Media

36

Statuses

166

4th year PhD Student at @Cambridge_Uni. IRL x LLMs. Superhuman Intelligence needs RL, and LLMs help human to learn from machine intelligence.

Cambridge, UK

Joined October 2022

Don't wanna be here? Send us removal request.

Hao Sun - RL

@HolarisSun

10 months

I have been working on Reward Modeling (and Inverse RL) for LLMs for the past 1.5 years. We built reward models (RMs) for prompting, dense RMs to improve credit assignment, and RMs from the SFT data. However, many questions remained unclear to me until this paper was finished.🧵

2

30

204

Hao Sun - RL

@HolarisSun

1 month

🚀 RL is powering breakthroughs in LLM alignment, reasoning, and agentic apps. Are you ready to dive into the RL x LLM frontier?. Join us at @aclmeeting ACL’25 tutorial:.Inverse RL Meets LLM Alignment .this Sunday at Vienna🇦🇹(Jul 27th, 9am). 📄 Preprint at

huggingface.co

0

12

67

Grok

@grok

10 days

What do you want to know?.

505

362

3K

Hao Sun - RL

@HolarisSun

2 months

This is SCIENCE🚀!!!.

Alan Jeffares @ ICML 🇨🇦

@Jeffaresalan

2 months

Our new ICML 2025 oral paper proposes a new unified theory of both Double Descent and Grokking, revealing that both of these deep learning phenomena can be understood as being caused by prime numbers in the network parameters 🤯🤯. 🧵[1/8]

1

0

8

Hao Sun - RL

@HolarisSun

2 months

If you're interested in RLHF and reward modeling check it out — and feel free to chat with Jef at #ICML2025!. 📄 🔗 🤝 Joint work with @ShenRaphael and @jeanfrancois287.

github.com

Active reward modeling with last layer Fisher Information (ICML'25) - YunyiShen/ARM-FI

1

4

10

Hao Sun - RL

@HolarisSun

2 months

We revive classical tools from the stats & experimental design literature. It turns out, many modern challenges already have elegant, sample-efficient solutions hidden there. All experiments were run efficiently on CPU-only machines, using our RM infra (open-sourced)!.

1

0

1

Hao Sun - RL

@HolarisSun

2 months

Our paper asks: How can we most effectively collect preference data for RM training?. We find that reward modeling is like drawing the contour of a mountain but using only pairwise comparisons. To trace it well, you need both local geometry and global structure 🏔️.

1

0

Hao Sun - RL

@HolarisSun

2 months

Unfortunately won't be able to attend #ICML2025 due to a long pending Canadian visa application — submitted in Oct 2023, still pending after 625 days 🙂‍↔️. That said, I'm excited to share our paper on Active Preference Learning & Understanding Reward Models 🧵👇

1

2

49

Hao Sun - RL

@HolarisSun

3 months

Now with Qwen’s RL-fine-tuning results, are we witnessing a quiet return of prompt optimization/engineering?. Now we have a 2-player game: users become “lazy prompters”, but the system prompts (e.g. thinking patterns) need to be highly optimized. Next: Bi-level optimization?

0

3

Hao Sun - RL

@HolarisSun

3 months

"Knowledge belongs to humanity, and is the torch which illuminates the world.".— Louis Pasteur. Especially for those contributed by the community.

0

7

Hao Sun - RL

@HolarisSun

4 months

AI cannot feel time, then how can it really understand humans?.

0

2

Hao Sun - RL

@HolarisSun

4 months

RT @jeanfrancois287: 📢New Paper on Process Reward Modelling 📢. Ever wondered about the pathologies of existing PRMs and how they could be r….

0

74

0

Hao Sun - RL

@HolarisSun

4 months

RT @jeanfrancois287: Happy to share that our paper on "Active Reward Modeling" has been accepted to ICML 2025! #ICML2025 . The part I like….

0

3

0

Hao Sun - RL

@HolarisSun

4 months

OpenReview Justice!.

Yunyi Shen/申云逸 🐺

@ShenRaphael

4 months

I'm honestly a bit surprised but whatever! Worth celebrating .Here is our arxived paper. With .@HolarisSun. and .@jeanfrancois287 .

0

5

Hao Sun - RL

@HolarisSun

4 months

RT @ShenRaphael: Glad to be there with @HolarisSun presenting our work

0

7

0

Hao Sun - RL

@HolarisSun

4 months

ICLR wrapped! Eggie and Toastie said it was the BEST🥰

2

50

Hao Sun - RL

@HolarisSun

4 months

The oral sessions and poster sessions are happening at the same time, so it actually feels like the oral speakers are just talking to each other🤣.

Yunyi Shen/申云逸 🐺

@ShenRaphael

4 months

@HolarisSun is famous now!

0

6

Hao Sun - RL

@HolarisSun

4 months

3. Going beyond Games, Agentic interactions with the virtual world, with diverse tasks, can further enhance the capabilities of LLM-based AI systems. Those interactions are all EXPERIENCE from the agents themselves, rather than (bounded) knowledge from human.

0

1

Hao Sun - RL

@HolarisSun

4 months

2. Game. However, the human-centered AI systems in its current paradigm can never outperform human (ref: “Welcome to the era of experience, Silver & Sutton”). Games, or rule-based tasks — combined with self play — has great potential for the next breakthrough.

1

0

Hao Sun - RL

@HolarisSun

4 months

1. Inverse RL. We are now at the stage of “human-centered AI”, and human feedback or preferences is essential in providing REWARD SIGNALs for RL. From experience, we know WHENEVER there is a good reward, RL is able to optimize it to its limit.

1

0

Hao Sun - RL

@HolarisSun

4 months

Recently finished an article about 𝗧𝗵𝗲 𝗙𝗼𝘂𝗿-𝗦𝘁𝗲𝗽 𝗟𝗮𝗱𝗱𝗲𝗿 𝗳𝗿𝗼𝗺 𝗥𝗟 𝘁𝗼 𝗔𝗚𝗜. imo, those steps are.1. Inverse RL.2. Game Experience.3. Virtual Exp.4. Physical Exp.Still working on polishing it, but keen to discuss with old and new friends during ICLR🇸🇬!.🧵

1

6

Hao Sun - RL

@HolarisSun

5 months

Heading to 🇸🇬ICLR next week!.Can’t wait to catch up with old friends and meet new ones — let’s chat about RL, reward models, alignment, reasoning, and agents!. Also, fun fact🤓: Yunyi won’t be there physically, but his digital twin will be attending instead. Stay tuned!.

0

2

18