Huan Sun (Hiring Ph.D. students for Fall26)
@hhsun1
Followers
6K
Following
3K
Media
54
Statuses
833
Prof. @OhioState, endowed CoE Innovation Scholar, advancing the capability and safety/security of LLM-based agents, understanding transformers' limitations
The Ohio State University
Joined March 2012
Jianyang is a fantastic collaborator. He led BioCLIP 2 to a huge success, and is also a great mentor to younger students and larger teams (FinerCAM, SST). 10/10 recommend working with/hiring Jianyang!
1
1
2
I’m also hiring researchers and engineers at #NeurIPS for NeoCognition. If you find true passion in agent plasticity and reliability or an alternative path to general intelligence, happy to chat! hiring@neocognition.io
@osunlp reunion at #NeurIPS! Many are on the faculty/industry job market @luke_ch_song @iamsamstevens @BoshiWang2 @vimar_gu @zhenwang9102. They are top in their game. Go talk to them!
1
9
61
So nice to see everyone! Wish whoever on the job market or taking new adventures BEST of LUCK!
@osunlp reunion at #NeurIPS! Many are on the faculty/industry job market @luke_ch_song @iamsamstevens @BoshiWang2 @vimar_gu @zhenwang9102. They are top in their game. Go talk to them!
1
0
27
@ysu_nlp is one of the most visionary people IMO. He was thinking about computer-use agents, world models, and continual learning 2-3 years ago, long before they became mainstream. I have always trusted his vision, and this time is no exception. Working at NeoCognition is one of
Life update: I moved to silicon valley to tackle agents' biggest challenges: plasticity and reliability. Today's agents are smart but brittle. They lack plasticity (continual learning and adaptation) and reliability (stable, predictable behavior with bounded failures). These two
2
8
44
The GDM mechanistic interpretability team has pivoted to a new approach: pragmatic interpretability Our post details how we now do research, why now is the time to pivot, why we expect this way to have more impact and why we think other interp researchers should follow suit
30
91
655
@DakingRai is one of my best. Highly recommend for any team recruiting for LLM Interp/reasoning/controllability/etc. He plans to graduate in a year or so. Go meet him at the poster session or msg him for a chat directly! Exhibit Hall C,D,E #3815 Thu 4 Dec 4:30 p.m. PST — 7:30
I’m actively looking for Summer 2026 internships focused on language model interpretability and methods to improve model reasoning and controllability. I’m also attending @NeurIPSConf —would love to connect! Resume & details:
1
4
19
Paper rejected from #NeurIPS on space constraints with 3/4 accepts. Resubmit to #ICLR: 2 rejects, 1 accept. Work hard, get miracle: responsive reviewers who all raise their scores after discussion. Then all scores revert due to a bug. What a ride @iclr_conf!"
2
8
199
I think that the ICLR response to leaking of identites is a bit nonsense.. as reviewer I was in the middle of a discussion with the authors on a couple of papers, and I really wanted to ask some final questions to finish understanding how to change the score
7
3
87
I respect that @iclr_conf had to respond to the OR leak, but I disagree with resetting scores. Many students worked hard on rebuttals and improved their papers in good faith. I hope the organizers reconsider and revert the reset. If you agree, feel free to retweet.
6
49
152
I was an ICLR PC two years ago. Woke up every morning to hundreds of unread conference-related emails. It’s simply not sustainable. And let’s not blame the committee—they’re already working nonstop putting out fires. We need structural fixes: smarter tooling, sane reviewer
it may seem like an ordinary day, but it could become the strangest moment in peer review and open science please please please treat our community with care. it’s already so fragile. don’t let it die.
7
19
246
Pros: Claude 4.5 Opus sets a new SoTA on OSWorld for computer-use capabilities. Cons: It also achieves the highest ASR (83%) on our new RedTeamCUA evaluation. Capability is rising fast, but trustworthiness depends on security keeping pace.
Computer-use agents are getting more capable, but are they getting more secure? No! We are actually observing the opposite trend. @AnthropicAI @claudeai Opus 4.5 released two days ago tops the OSWorld leaderboard, but is showing the highest Attack Success Rate (ASR) on our
0
2
3
It's time to shift from LLM alignment to CUA alignment! I will be attending NeurIPS next week. Happy to chat more on LLM/Agent safety/security and build truly deployable and reliable agents together! I am also exploring better on-policy distillation. If you're doing similar
Computer-use agents are getting more capable, but are they getting more secure? No! We are actually observing the opposite trend. @AnthropicAI @claudeai Opus 4.5 released two days ago tops the OSWorld leaderboard, but is showing the highest Attack Success Rate (ASR) on our
3
2
15
RedTeamCUA: https://t.co/Kb5DvRZgco we will update our draft with the new results soon! Demos to see what a successful attack looks like:
osu-nlp-group.github.io
Realistic adversarial testing of computer-use agents using a hybrid web-OS environment and large-scale benchmark for indirect prompt injection based adversarial testing.
0
0
5
Computer-use agents are getting more capable, but are they getting more secure? No! We are actually observing the opposite trend. @AnthropicAI @claudeai Opus 4.5 released two days ago tops the OSWorld leaderboard, but is showing the highest Attack Success Rate (ASR) on our
Introducing Claude Opus 4.5: the best model in the world for coding, agents, and computer use. Opus 4.5 is a step forward in what AI systems can do, and a preview of larger changes to how work gets done.
1
11
33
@yoavgo Thanks Yoav! I’m not leaving academia and will certainly continue to publish. Meanwhile, agents have gotten to a point where learning from deployment is necessary. Besides business, I hope this will also be a new kind of research platform that shares back with the community.
2
1
15
Life update: I moved to silicon valley to tackle agents' biggest challenges: plasticity and reliability. Today's agents are smart but brittle. They lack plasticity (continual learning and adaptation) and reliability (stable, predictable behavior with bounded failures). These two
40
43
421
Come help Computer Science pay out the tech debt vibe coding is creating! @ohiostate CSE is hiring THREE faculty positions in Programming Languages and Software Engineering, including an endowed chair. https://t.co/sCYoRQMoNA
linkedin.com
Posted 9:41:46 PM. Timashev Professor, Tenure-Track, Open Rank The Ohio State UniversityColumbus…See this and similar jobs on LinkedIn.
1
1
1
We are hiring @osunlp! - frontier agent/LLM research - hundreds of H100s/A100s + clouds - uncapped LLM APIs Join us if you have bold ideas about the future of AI
8
21
150
🔥🔥Ph.D. positions available at the OSU NLP group @osunlp (led by Yu Su @ysu_nlp, Sachin Kumar, @shocheen and myself). Collectively, we're interested in agents (everything to advance their capability and safety/security), multilingual models, continual learning, personalization,
Huge thanks to Dr. Huan Sun (@hhsun1) for her fantastic talk last week at the AI Agent Frontier Seminar! She gave a deep dive into "Advancing the Capability and Safety of Computer-Use Agents Together." In case you missed it, the full recording is now available:
3
33
163
Big congratulations to the Yutori team!!!Thanks for choosing our Online-Mind2Web for evaluation.
Introducing Yutori Navigator 31 years ago, the modern web era began with Netscape Navigator. Today, we’re introducing Yutori Navigator — a web agent that autonomously navigates websites on its own cloud browser to complete tasks for you. Navigator achieves pareto-domination
0
4
24