Huan Sun (Hiring Ph.D. students for Fall26) Profile
Huan Sun (Hiring Ph.D. students for Fall26)

@hhsun1

Followers
6K
Following
3K
Media
54
Statuses
833

Prof. @OhioState, endowed CoE Innovation Scholar, advancing the capability and safety/security of LLM-based agents, understanding transformers' limitations

The Ohio State University
Joined March 2012
Don't wanna be here? Send us removal request.
@iamsamstevens
Sam Stevens @ NeurIPS 2025
4 days
Jianyang is a fantastic collaborator. He led BioCLIP 2 to a huge success, and is also a great mentor to younger students and larger teams (FinerCAM, SST). 10/10 recommend working with/hiring Jianyang!
1
1
2
@ysu_nlp
Yu Su (Hiring @Neurips)
4 days
I’m also hiring researchers and engineers at #NeurIPS for NeoCognition. If you find true passion in agent plasticity and reliability or an alternative path to general intelligence, happy to chat! hiring@neocognition.io
@osunlp
OSU NLP Group
4 days
@osunlp reunion at #NeurIPS! Many are on the faculty/industry job market @luke_ch_song @iamsamstevens @BoshiWang2 @vimar_gu @zhenwang9102. They are top in their game. Go talk to them!
1
9
61
@hhsun1
Huan Sun (Hiring Ph.D. students for Fall26)
4 days
So nice to see everyone! Wish whoever on the job market or taking new adventures BEST of LUCK!
@osunlp
OSU NLP Group
4 days
@osunlp reunion at #NeurIPS! Many are on the faculty/industry job market @luke_ch_song @iamsamstevens @BoshiWang2 @vimar_gu @zhenwang9102. They are top in their game. Go talk to them!
1
0
27
@KaiZhang_CS
Kai Zhang
7 days
@ysu_nlp is one of the most visionary people IMO. He was thinking about computer-use agents, world models, and continual learning 2-3 years ago, long before they became mainstream. I have always trusted his vision, and this time is no exception. Working at NeoCognition is one of
@ysu_nlp
Yu Su (Hiring @Neurips)
13 days
Life update: I moved to silicon valley to tackle agents' biggest challenges: plasticity and reliability. Today's agents are smart but brittle. They lack plasticity (continual learning and adaptation) and reliability (stable, predictable behavior with bounded failures). These two
2
8
44
@NeelNanda5
Neel Nanda
7 days
The GDM mechanistic interpretability team has pivoted to a new approach: pragmatic interpretability Our post details how we now do research, why now is the time to pivot, why we expect this way to have more impact and why we think other interp researchers should follow suit
30
91
655
@ZiyuYao
Ziyu Yao (Hiring Fall'26 PhDs)
8 days
@DakingRai is one of my best. Highly recommend for any team recruiting for LLM Interp/reasoning/controllability/etc. He plans to graduate in a year or so. Go meet him at the poster session or msg him for a chat directly! Exhibit Hall C,D,E #3815 Thu 4 Dec 4:30 p.m. PST — 7:30
@DakingRai
Daking Rai
8 days
I’m actively looking for Summer 2026 internships focused on language model interpretability and methods to improve model reasoning and controllability. I’m also attending @NeurIPSConf —would love to connect! Resume & details:
1
4
19
@egor_zverev_ai
Egor Zverev
10 days
Paper rejected from #NeurIPS on space constraints with 3/4 accepts. Resubmit to #ICLR: 2 rejects, 1 accept. Work hard, get miracle: responsive reviewers who all raise their scores after discussion. Then all scores revert due to a bug. What a ride @iclr_conf!"
2
8
199
@MirceaSci
Mircea Petrache
10 days
I think that the ICLR response to leaking of identites is a bit nonsense.. as reviewer I was in the middle of a discussion with the authors on a couple of papers, and I really wanted to ask some final questions to finish understanding how to change the score
7
3
87
@chrsmrrs
Christopher Morris
10 days
I respect that @iclr_conf had to respond to the OR leak, but I disagree with resetting scores. Many students worked hard on rebuttals and improved their papers in good faith. I hope the organizers reconsider and revert the reset. If you agree, feel free to retweet.
6
49
152
@MengdiWang10
Mengdi Wang
10 days
I was an ICLR PC two years ago. Woke up every morning to hundreds of unread conference-related emails. It’s simply not sustainable. And let’s not blame the committee—they’re already working nonstop putting out fires. We need structural fixes: smarter tooling, sane reviewer
@sainingxie
Saining Xie
11 days
it may seem like an ordinary day, but it could become the strangest moment in peer review and open science please please please treat our community with care. it’s already so fragile. don’t let it die.
7
19
246
@Jaylen_JonesNLP
Jaylen Jones
12 days
Pros: Claude 4.5 Opus sets a new SoTA on OSWorld for computer-use capabilities. Cons: It also achieves the highest ASR (83%) on our new RedTeamCUA evaluation. Capability is rising fast, but trustworthiness depends on security keeping pace.
@hhsun1
Huan Sun (Hiring Ph.D. students for Fall26)
12 days
Computer-use agents are getting more capable, but are they getting more secure? No! We are actually observing the opposite trend. @AnthropicAI @claudeai Opus 4.5 released two days ago tops the OSWorld leaderboard, but is showing the highest Attack Success Rate (ASR) on our
0
2
3
@LiaoZeyi
Zeyi Liao
12 days
It's time to shift from LLM alignment to CUA alignment! I will be attending NeurIPS next week. Happy to chat more on LLM/Agent safety/security and build truly deployable and reliable agents together! I am also exploring better on-policy distillation. If you're doing similar
@hhsun1
Huan Sun (Hiring Ph.D. students for Fall26)
12 days
Computer-use agents are getting more capable, but are they getting more secure? No! We are actually observing the opposite trend. @AnthropicAI @claudeai Opus 4.5 released two days ago tops the OSWorld leaderboard, but is showing the highest Attack Success Rate (ASR) on our
3
2
15
@hhsun1
Huan Sun (Hiring Ph.D. students for Fall26)
12 days
RedTeamCUA: https://t.co/Kb5DvRZgco we will update our draft with the new results soon! Demos to see what a successful attack looks like:
osu-nlp-group.github.io
Realistic adversarial testing of computer-use agents using a hybrid web-OS environment and large-scale benchmark for indirect prompt injection based adversarial testing.
0
0
5
@hhsun1
Huan Sun (Hiring Ph.D. students for Fall26)
12 days
Computer-use agents are getting more capable, but are they getting more secure? No! We are actually observing the opposite trend. @AnthropicAI @claudeai Opus 4.5 released two days ago tops the OSWorld leaderboard, but is showing the highest Attack Success Rate (ASR) on our
@claudeai
Claude
14 days
Introducing Claude Opus 4.5: the best model in the world for coding, agents, and computer use. Opus 4.5 is a step forward in what AI systems can do, and a preview of larger changes to how work gets done.
1
11
33
@ysu_nlp
Yu Su (Hiring @Neurips)
12 days
@yoavgo Thanks Yoav! I’m not leaving academia and will certainly continue to publish. Meanwhile, agents have gotten to a point where learning from deployment is necessary. Besides business, I hope this will also be a new kind of research platform that shares back with the community.
2
1
15
@ysu_nlp
Yu Su (Hiring @Neurips)
13 days
Life update: I moved to silicon valley to tackle agents' biggest challenges: plasticity and reliability. Today's agents are smart but brittle. They lack plasticity (continual learning and adaptation) and reliability (stable, predictable behavior with bounded failures). These two
40
43
421
@arnabdotorg
Arnab Nandi ⭐️
18 days
Come help Computer Science pay out the tech debt vibe coding is creating! @ohiostate CSE is hiring THREE faculty positions in Programming Languages and Software Engineering, including an endowed chair. https://t.co/sCYoRQMoNA
linkedin.com
Posted 9:41:46 PM. Timashev Professor, Tenure-Track, Open Rank The Ohio State UniversityColumbus…See this and similar jobs on LinkedIn.
1
1
1
@ysu_nlp
Yu Su (Hiring @Neurips)
18 days
We are hiring @osunlp! - frontier agent/LLM research - hundreds of H100s/A100s + clouds - uncapped LLM APIs Join us if you have bold ideas about the future of AI
@hhsun1
Huan Sun (Hiring Ph.D. students for Fall26)
18 days
🔥🔥Ph.D. positions available at the OSU NLP group @osunlp (led by Yu Su @ysu_nlp, Sachin Kumar, @shocheen and myself). Collectively, we're interested in agents (everything to advance their capability and safety/security), multilingual models, continual learning, personalization,
8
21
150
@hhsun1
Huan Sun (Hiring Ph.D. students for Fall26)
18 days
🔥🔥Ph.D. positions available at the OSU NLP group @osunlp (led by Yu Su @ysu_nlp, Sachin Kumar, @shocheen and myself). Collectively, we're interested in agents (everything to advance their capability and safety/security), multilingual models, continual learning, personalization,
@MingJin80233626
Ming Jin (hiring Fall'26 PhDs)
19 days
Huge thanks to Dr. Huan Sun (@hhsun1) for her fantastic talk last week at the AI Agent Frontier Seminar! She gave a deep dive into "Advancing the Capability and Safety of Computer-Use Agents Together." In case you missed it, the full recording is now available:
3
33
163
@hhsun1
Huan Sun (Hiring Ph.D. students for Fall26)
19 days
Big congratulations to the Yutori team!!!Thanks for choosing our Online-Mind2Web for evaluation.
@DhruvBatra_
Dhruv Batra
19 days
Introducing Yutori Navigator 31 years ago, the modern web era began with Netscape Navigator. Today, we’re introducing Yutori Navigator — a web agent that autonomously navigates websites on its own cloud browser to complete tasks for you. Navigator achieves pareto-domination
0
4
24