Huan Sun (Hiring Ph.D. students for Fall26) @hhsun1 X Profile

Huan Sun (Hiring Ph.D. students for Fall26)

@hhsun1

Followers

6K

Following

3K

Media

54

Statuses

833

Prof. @OhioState, endowed CoE Innovation Scholar, advancing the capability and safety/security of LLM-based agents, understanding transformers' limitations

https://t.co/nCB4TM0UgT

The Ohio State University

Joined March 2012

Don't wanna be here? Send us removal request.

Sam Stevens @ NeurIPS 2025

@iamsamstevens

4 days

Jianyang is a fantastic collaborator. He led BioCLIP 2 to a huge success, and is also a great mentor to younger students and larger teams (FinerCAM, SST). 10/10 recommend working with/hiring Jianyang!

1

2

Yu Su (Hiring @Neurips)

@ysu_nlp

4 days

I’m also hiring researchers and engineers at #NeurIPS for NeoCognition. If you find true passion in agent plasticity and reliability or an alternative path to general intelligence, happy to chat! hiring@neocognition.io

OSU NLP Group

@osunlp

4 days

@osunlp reunion at #NeurIPS! Many are on the faculty/industry job market @luke_ch_song @iamsamstevens @BoshiWang2 @vimar_gu @zhenwang9102. They are top in their game. Go talk to them!

1

9

61

Huan Sun (Hiring Ph.D. students for Fall26)

@hhsun1

4 days

So nice to see everyone! Wish whoever on the job market or taking new adventures BEST of LUCK!

OSU NLP Group

@osunlp

4 days

@osunlp reunion at #NeurIPS! Many are on the faculty/industry job market @luke_ch_song @iamsamstevens @BoshiWang2 @vimar_gu @zhenwang9102. They are top in their game. Go talk to them!

1

0

27

Kai Zhang

@KaiZhang_CS

7 days

@ysu_nlp is one of the most visionary people IMO. He was thinking about computer-use agents, world models, and continual learning 2-3 years ago, long before they became mainstream. I have always trusted his vision, and this time is no exception. Working at NeoCognition is one of

Yu Su (Hiring @Neurips)

@ysu_nlp

13 days

Life update: I moved to silicon valley to tackle agents' biggest challenges: plasticity and reliability. Today's agents are smart but brittle. They lack plasticity (continual learning and adaptation) and reliability (stable, predictable behavior with bounded failures). These two

2

8

44

Neel Nanda

@NeelNanda5

7 days

The GDM mechanistic interpretability team has pivoted to a new approach: pragmatic interpretability Our post details how we now do research, why now is the time to pivot, why we expect this way to have more impact and why we think other interp researchers should follow suit

30

91

655

Ziyu Yao (Hiring Fall'26 PhDs)

@ZiyuYao

8 days

@DakingRai is one of my best. Highly recommend for any team recruiting for LLM Interp/reasoning/controllability/etc. He plans to graduate in a year or so. Go meet him at the poster session or msg him for a chat directly! Exhibit Hall C,D,E #3815 Thu 4 Dec 4:30 p.m. PST — 7:30

Daking Rai

@DakingRai

8 days

I’m actively looking for Summer 2026 internships focused on language model interpretability and methods to improve model reasoning and controllability. I’m also attending @NeurIPSConf —would love to connect! Resume & details:

1

4

19

Egor Zverev

@egor_zverev_ai

10 days

Paper rejected from #NeurIPS on space constraints with 3/4 accepts. Resubmit to #ICLR: 2 rejects, 1 accept. Work hard, get miracle: responsive reviewers who all raise their scores after discussion. Then all scores revert due to a bug. What a ride @iclr_conf!"

2

8

199

Mircea Petrache

@MirceaSci

10 days

I think that the ICLR response to leaking of identites is a bit nonsense.. as reviewer I was in the middle of a discussion with the authors on a couple of papers, and I really wanted to ask some final questions to finish understanding how to change the score

7

3

87

Christopher Morris

@chrsmrrs

10 days

I respect that @iclr_conf had to respond to the OR leak, but I disagree with resetting scores. Many students worked hard on rebuttals and improved their papers in good faith. I hope the organizers reconsider and revert the reset. If you agree, feel free to retweet.

6

49

152

Mengdi Wang

@MengdiWang10

10 days

I was an ICLR PC two years ago. Woke up every morning to hundreds of unread conference-related emails. It’s simply not sustainable. And let’s not blame the committee—they’re already working nonstop putting out fires. We need structural fixes: smarter tooling, sane reviewer

Saining Xie

@sainingxie

11 days

it may seem like an ordinary day, but it could become the strangest moment in peer review and open science please please please treat our community with care. it’s already so fragile. don’t let it die.

7

19

246

Jaylen Jones

@Jaylen_JonesNLP

12 days

Pros: Claude 4.5 Opus sets a new SoTA on OSWorld for computer-use capabilities. Cons: It also achieves the highest ASR (83%) on our new RedTeamCUA evaluation. Capability is rising fast, but trustworthiness depends on security keeping pace.

Huan Sun (Hiring Ph.D. students for Fall26)

@hhsun1

12 days

Computer-use agents are getting more capable, but are they getting more secure? No! We are actually observing the opposite trend. @AnthropicAI @claudeai Opus 4.5 released two days ago tops the OSWorld leaderboard, but is showing the highest Attack Success Rate (ASR) on our

0

2

3

Zeyi Liao

@LiaoZeyi

12 days

It's time to shift from LLM alignment to CUA alignment! I will be attending NeurIPS next week. Happy to chat more on LLM/Agent safety/security and build truly deployable and reliable agents together! I am also exploring better on-policy distillation. If you're doing similar

Huan Sun (Hiring Ph.D. students for Fall26)

@hhsun1

12 days

Computer-use agents are getting more capable, but are they getting more secure? No! We are actually observing the opposite trend. @AnthropicAI @claudeai Opus 4.5 released two days ago tops the OSWorld leaderboard, but is showing the highest Attack Success Rate (ASR) on our

3

2

15

Huan Sun (Hiring Ph.D. students for Fall26)

@hhsun1

12 days

RedTeamCUA: https://t.co/Kb5DvRZgco we will update our draft with the new results soon! Demos to see what a successful attack looks like:

osu-nlp-group.github.io

Realistic adversarial testing of computer-use agents using a hybrid web-OS environment and large-scale benchmark for indirect prompt injection based adversarial testing.

0

5

Huan Sun (Hiring Ph.D. students for Fall26)

@hhsun1

12 days

Computer-use agents are getting more capable, but are they getting more secure? No! We are actually observing the opposite trend. @AnthropicAI @claudeai Opus 4.5 released two days ago tops the OSWorld leaderboard, but is showing the highest Attack Success Rate (ASR) on our

Claude

@claudeai

14 days

Introducing Claude Opus 4.5: the best model in the world for coding, agents, and computer use. Opus 4.5 is a step forward in what AI systems can do, and a preview of larger changes to how work gets done.

1

11

33

Yu Su (Hiring @Neurips)

@ysu_nlp

12 days

@yoavgo Thanks Yoav! I’m not leaving academia and will certainly continue to publish. Meanwhile, agents have gotten to a point where learning from deployment is necessary. Besides business, I hope this will also be a new kind of research platform that shares back with the community.

2

1

15

Yu Su (Hiring @Neurips)

@ysu_nlp

13 days

Life update: I moved to silicon valley to tackle agents' biggest challenges: plasticity and reliability. Today's agents are smart but brittle. They lack plasticity (continual learning and adaptation) and reliability (stable, predictable behavior with bounded failures). These two

40

43

421

Arnab Nandi ⭐️

@arnabdotorg

18 days

Come help Computer Science pay out the tech debt vibe coding is creating! @ohiostate CSE is hiring THREE faculty positions in Programming Languages and Software Engineering, including an endowed chair. https://t.co/sCYoRQMoNA

linkedin.com

Posted 9:41:46 PM. Timashev Professor, Tenure-Track, Open Rank The Ohio State UniversityColumbus…See this and similar jobs on LinkedIn.

1

Yu Su (Hiring @Neurips)

@ysu_nlp

18 days

We are hiring @osunlp! - frontier agent/LLM research - hundreds of H100s/A100s + clouds - uncapped LLM APIs Join us if you have bold ideas about the future of AI

Huan Sun (Hiring Ph.D. students for Fall26)

@hhsun1

18 days

🔥🔥Ph.D. positions available at the OSU NLP group @osunlp (led by Yu Su @ysu_nlp, Sachin Kumar, @shocheen and myself). Collectively, we're interested in agents (everything to advance their capability and safety/security), multilingual models, continual learning, personalization,

8

21

150

Huan Sun (Hiring Ph.D. students for Fall26)

@hhsun1

18 days

🔥🔥Ph.D. positions available at the OSU NLP group @osunlp (led by Yu Su @ysu_nlp, Sachin Kumar, @shocheen and myself). Collectively, we're interested in agents (everything to advance their capability and safety/security), multilingual models, continual learning, personalization,

Ming Jin (hiring Fall'26 PhDs)

@MingJin80233626

19 days

Huge thanks to Dr. Huan Sun (@hhsun1) for her fantastic talk last week at the AI Agent Frontier Seminar! She gave a deep dive into "Advancing the Capability and Safety of Computer-Use Agents Together." In case you missed it, the full recording is now available:

3

33

163

Huan Sun (Hiring Ph.D. students for Fall26)

@hhsun1

19 days

Big congratulations to the Yutori team!!!Thanks for choosing our Online-Mind2Web for evaluation.

Dhruv Batra

@DhruvBatra_

19 days

Introducing Yutori Navigator 31 years ago, the modern web era began with Netscape Navigator. Today, we’re introducing Yutori Navigator — a web agent that autonomously navigates websites on its own cloud browser to complete tasks for you. Navigator achieves pareto-domination

0

4

24