ZhiyuanCS Profile Banner
Zhiyuan Profile
Zhiyuan

@ZhiyuanCS

Followers
375
Following
18
Media
19
Statuses
48

PhD student in @NUSingapore Visiting Researcher in @MIT

Singapore
Joined April 2018
Don't wanna be here? Send us removal request.
@ZhiyuanCS
Zhiyuan
18 days
RT @VictorKaiWang1: Customizing Your LLMs in seconds using prompts🥳!.Excited to share our latest work with @HPCAILab, @VITAGroupUT, @k_schu….
0
72
0
@ZhiyuanCS
Zhiyuan
28 days
🚨🚨Reviewed around 20 papers for @ACMMM—but our own reviews were hidden & forced on us without expertise match. Time to rethink AI community peer review. 🤔. Our author team were assigned nearly 20 papers with no regard for our areas of expertise, received only a single round of.
0
0
6
@ZhiyuanCS
Zhiyuan
2 months
I can’t believe this jaw‑dropping comic was generated by GPT just by feeding it our paper directly🤯! . It perfectly illustrates how meta‑ability training makes LRMs think better.
Tweet media one
0
0
6
@ZhiyuanCS
Zhiyuan
2 months
🚀 Beyond “aha”: toward Meta‑Abilities Alignment!.Zero human annotation enables LRMs masters strong reasoning abilities rather than aha emerging and generalize across math ⚙️, code 💻, science 🔬. Meta‑ability alignment lifts the ceiling of further domain‑RL—7B → 32B
Tweet media one
Tweet media two
2
18
98
@ZhiyuanCS
Zhiyuan
2 months
🚀 Beyond 'aha': toward Meta‑Abilities Alignment!.By self‑synthesizes training tasks & self‑verifies rewards with zero human labels, LLM systematically masters core reasoning abilities rather than aha emerging and generalize across math ⚙️, code 💻, science 🔬. Meta‑ability
Tweet media one
Tweet media two
1
2
17
@ZhiyuanCS
Zhiyuan
2 months
Although the ICLR main conference is coming to an end, we are excited to invite you to the Reasoning and Planning for LLMs Workshop, which will be held all day on Monday, April 28. We are honored to host an outstanding lineup of keynote speakers and panelists from Meta, OpenAI,
Tweet media one
1
8
23
@ZhiyuanCS
Zhiyuan
3 months
RT @NuoJohnChen: Welcome to use JudgeLRM! Compare any Hugging Face language models by asking your own questions, and explore JudgeLRM’s rea….
0
2
0
@ZhiyuanCS
Zhiyuan
5 months
🚀 Exciting news! The ICLR 2025 LLM Reasoning & Planning Workshop is offering several Student Registration Grants to support early-career researchers.💡 Free ICLR registration for in-person full-time students! . Apply by March 2, 2025. More info: Submit.
0
7
34
@ZhiyuanCS
Zhiyuan
5 months
🚀 Call for Reviewers! 🚀. Our Workshop on Reasoning and Planning for LLMs at ICLR 2025 @iclr_conf has received an overwhelming number of submissions! We are looking for reviewers to help ensure a high-quality selection process. 🔹 Max 2 papers per reviewer.🔹 Review deadline:.
0
7
17
@ZhiyuanCS
Zhiyuan
5 months
We are excited to announce that our workshop will be held on April 28 in Singapore. Due to numerous requests for extensions, we have decided to extend the submission deadline by 4 days to February 6 (AoE). We look forward to receiving your submissions and can't wait to see you at
Tweet media one
0
5
14
@ZhiyuanCS
Zhiyuan
7 months
RT @Mengyue_Yang_: 🚀 Excited to announce our World Models: Understanding, Modelling and Scaling Workshop at #ICLR2025! 🎉. Keynote speakers,….
0
14
0
@ZhiyuanCS
Zhiyuan
7 months
Our poster presentation at #NeurIPS2024 will take place today from 11:00 AM to 2:00 PM in West Ballroom A-D, Poster #7004. We warmly welcome you to stop by and engage with us!.
@ZhiyuanCS
Zhiyuan
1 year
How do LLMs conduct reasoning and planning given partial information with uncertainty? Whether they can proactively ask questions to improve decision-making?. In joint work with UW, NTU, Yale and UCL, we introduce the UoT method, which boosts the information-seeking and
Tweet media one
Tweet media two
Tweet media three
0
2
10
@ZhiyuanCS
Zhiyuan
7 months
The tentative schedule has been updated and it will be: . - Paper Deadline: February 2, 2025 (AOE) .- Notification: March 3, 2025, (AOE) .- Camera-ready: April 3, 2025.
0
0
1
@ZhiyuanCS
Zhiyuan
7 months
🚀 Exciting News! Workshop on Reasoning and Planning for Large Language Models @ ICLR 2025 is coming 🌟. Please visit our official website: 👉 With the release of o1 Pro and the growing interest in research on reasoning and planning capabilities of LLMs,
Tweet media one
8
16
104
@ZhiyuanCS
Zhiyuan
8 months
Looking forward to more work unlocking the potential of NLRL.
@Xidong_Feng
Xidong Feng
8 months
Happy to share our new exploration "Natural Language Reinforcement Learning" (NLRL), the last dance of my PhD 🛎️(1/n):. Paper: Code: (released soon). NLRL reframes core RL concepts—policy, value function, Bellman equation, MC, TD,
Tweet media one
0
0
2