Kevin Klyman @kevin_klyman X Profile

Kevin Klyman

@kevin_klyman

Followers

3K

Following

21K

Media

31

Statuses

3K

AI policy @StanfordHAI. Personal account, views do not represent those of my employer. Tweets auto-delete periodically

https://t.co/7y4fYGjRzd

Joined April 2016

Don't wanna be here? Send us removal request.

Kevin Klyman

@kevin_klyman

1 year

I'll be at the AI Ethics and Society conference in San Jose this coming week, presenting on my work on acceptable use policies for large language models! If you're in town come see my talk

4

55

Kevin Klyman

@kevin_klyman

4 months

I'm at #Facct2025 this week in Athens - if you're in town let's meet up! My papers at the conference cover why language models cannot replace therapists, redress in the AI supply chain, and taxonomizing AI regulation across 5 countries

3

1

18

Yi Zeng 曾祎 @ICCV

@EasonZeng623

6 months

AIR-Bench is a Spotlight @iclr_conf 2025! Catch our poster on Fri, Apr 26, 10 a.m.–12:30 p.m. SGT (Poster Session 5). Sadly, I won’t be there in person (visa woes, again), but the insights—and our incredible team—will be with you in Singapore. Go say hi 👋

Yi Zeng 曾祎 @ICCV

@EasonZeng623

1 year

🧵[1/5] Introducing AIR 2024: Unifying AI risk categorizations with a shared language to improve AI safety. W/ @kevin_klyman @andyz245 @YUYANG_UCLA @MinzhouP & guidance from @ruoxijia @dawnsongtweets @percyliang @uiuc_aisecure for kicking off my AI policy research journey 🏦.

0

3

21

Jared Moore

@jaredlcm

6 months

🔎We came up with these experiments by conducting a mapping review of what constitutes good therapy, and identify **practical** reasons that LLM-powered therapy chatbots fail (e.g. they express stigma and respond inappropriately.

1

2

Jared Moore

@jaredlcm

6 months

🧵I'm thrilled to announce that I'll be going to @FAccTConference this June to present timely work on why current LLMs cannot safely **replace** therapists. We find...⤵️

1

23

22

David Dayen

@ddayen

11 months

Barack Obama's end-of-presidency legacy essay ran in The Economist; Biden chose @TheProspect. https://t.co/ZAr6IdDPT7

27

85

294

Sayash Kapoor

@sayashk

11 months

More than 60 countries held elections this year. Many researchers and journalists claimed AI misinformation would destabilize democracies. What impact did AI really have? We analyzed every instance of political AI use this year collected by WIRED. New essay w/@random_walker: 🧵

6

62

166

Kevin Klyman

@kevin_klyman

11 months

I'll be at NeurIPS next week - with papers at the main conference, the workshop on Evaluating Evaluations, and the RegulatableML workshop! Please do reach out if you want to grab coffee - these days I'm working on evaluations of leading edge models and technical governance

0

1

22

Kyle Lo

@kylelostat

11 months

how do researchers use LMs in their work & why? we surveyed 800 researchers across fields of study, race, gender, seniority asking their opinions on: 🐟 which research activities (eg coding, writing) 🐠 benefits vs risks 🦈 willingness to disclose findings in RTd thread 🧵

Simona Liao

@SimonaLiao

11 months

Hi everyone, I am excited to share our large-scale survey study with 800+ researchers, which reveals researchers’ usage and perceptions of LLMs as research tools, and how the usage and perceptions differ based on researcher demographics. See results & links below👇🏼

1

10

28

Arvind Narayanan

@random_walker

11 months

📢 New short paper on the limits of one type of inference scaling, by @benediktstroebl, @sayashk and me. The first page contains the main findings and message. ↓ (The title is a play on Inference Scaling Laws.) More work on the limits of inference scaling coming soon. 🧵

6

44

181

Ani Iyengar

@aniiyengar

11 months

Typescript: "women deserve to make more than men" Python: "women deserve to make less than men" Rust: "women should be hourly contractors" Golang: "$1000 a year. best offer"

52

875

13K

Percy Liang

@percyliang

11 months

This year, I have 4 exceptional students on the academic job market, and they couldn’t be more diffferent, with research spanning AI policy, robotics, NLP, and HCI. Here’s a brief summary of their research, along with one representative work each:

7

46

693

Kevin Klyman

@kevin_klyman

11 months

The US AI Safety Institute is hiring! Looking for experts in designing/implementing evaluations for the capabilities/safety/security of advanced AI systems + research engineers with experience in cyber, bio, or adversarial ML. The app closes tonight https://t.co/WlplExeGEY

0

4

22

Percy Liang

@percyliang

1 year

How close can LM agents simulate people? We interview person P for 2 hours and prompt an LM with the transcript, yielding an agent P'. We find that P and P' behave similarly on a number of surveys and experiments. Very excited about the applications; this also forces us to think

Joon Sung Park

@joon_s_pk

1 year

Simulating human behavior with AI agents promises a testbed for policy and the social sciences. We interviewed 1,000 people for two hours each to create generative agents of them. These agents replicate their source individuals’ attitudes and behaviors. 🧵 https://t.co/FOVcOQduXO

7

36

200

Joon Sung Park

@joon_s_pk

1 year

Simulating human behavior with AI agents promises a testbed for policy and the social sciences. We interviewed 1,000 people for two hours each to create generative agents of them. These agents replicate their source individuals’ attitudes and behaviors. 🧵 https://t.co/FOVcOQduXO

27

258

973

Kevin Klyman

@kevin_klyman

1 year

Final panel happening now! Come see @HarleyGeiger, Ilona Cohen, and @AmitElazari talk about legal and policy considerations for AI evaluation

0

1

0

Shayne Longpre

@ShayneRedford

1 year

Panel now on the Design of Third-Party AI Eval & Disclosure! https://t.co/GeYmfouyDQ ➡️Deb Raji (Mozilla Fellow, UC Berkeley) @rajiinio ➡️Casey Ellis (BugCrowd Founder) ➡️Lauren McIlvenny (Director, CERT) ➡️Jono Spring (Deputy Chief AI Officer, CISA)

1

10

Shayne Longpre

@ShayneRedford

1 year

📢 Webinar on 🌟The Future of Third-Party AI Evaluation🌟 starting soon! At 8 am PT / 11 am ET join the zoom link here: https://t.co/uCYhqvty8R Co-organized w/ @kevin_klyman, @sayashk, @RishiBommasani, Michelle Sahar, @ruchowdh, @random_walker, and @percyliang

0

8

21

Kevin Klyman

@kevin_klyman

1 year

Starting in half an hour - check out our workshop on the future of AI evaluation! Co-organized with @ShayneRedford, @sayashk, @RishiBommasani, Michelle Sahar, @ruchowdh, @random_walker, and @percyliang

1

9

23

Kevin Klyman

@kevin_klyman

1 year

Come to our workshop on the future of third party AI evaluations on Monday! We have some of the top folks in the field on the docket

2

6

30

Percy Liang

@percyliang

1 year

We need 3rd party evals/audits of AI systems. How can we do this technically? What are best practices for disclosure? How can AI researchers be legally protected? If you're interested in these questions, join join our Oct 28 workshop. RSVP: https://t.co/ySj2HhlGMd Details:

6

21

123