saffronhuang Profile Banner
Saffron Huang Profile
Saffron Huang

@saffronhuang

Followers
6K
Following
4K
Media
91
Statuses
1K

how shall we live together? societal impacts researcher @AnthropicAI • @collect_intel co-founder • ex @AISecurityInst @GoogleDeepMind⋅ views mine

sf
Joined April 2013
Don't wanna be here? Send us removal request.
@saffronhuang
Saffron Huang
4 months
Really proud and excited to release this work on empirically measuring AI values “in the wild” — understanding, analyzing and taxonomizing what values guide model outputs in real interactions with real users. There is a lot of work on training models to follow particular.
@AnthropicAI
Anthropic
4 months
New Anthropic research: AI values in the wild. We want AI models to have well-aligned values. But how do we know what values they’re expressing in real-life conversations?. We studied hundreds of thousands of anonymized conversations to find out.
Tweet media one
7
22
251
@saffronhuang
Saffron Huang
2 days
RT @claudeai: Today we're launching new ways to learn in Claude Code and the Claude app. First up: Claude Code now lets you customize comm���.
0
578
0
@grok
Grok
5 days
Turn old photos into videos and see friends and family come to life. Try Grok Imagine, free for a limited time.
709
1K
5K
@saffronhuang
Saffron Huang
2 days
RT @collect_intel: @divyasiddarth and @audreyt joined @reidhoffman and @AriaIrene, hosts of the Possible Podcast, to talk about one of the….
0
10
0
@saffronhuang
Saffron Huang
26 days
RT @hannahrosekirk: My team at @AISecurityInst is hiring! This is an awesome opportunity to get involved with cutting-edge scientific resea….
0
24
0
@saffronhuang
Saffron Huang
2 months
RT @AlexTamkin: Highly recommend applying if you have ideas for understanding AI's economic impacts or policy proposals! .
0
9
0
@saffronhuang
Saffron Huang
2 months
RT @AnthropicAI: Announcing the Anthropic Economic Futures Program—our latest commitment to understanding AI's impacts on work and the econ….
0
193
0
@saffronhuang
Saffron Huang
2 months
it’s really important to know both Claude’s “EQ” not just “IQ” (for lack of better terminology) — take a look at the findings below.
@AnthropicAI
Anthropic
2 months
New Anthropic Research: How people use Claude for emotional support. From millions of anonymized conversations, we studied how adults use AI for emotional and personal needs—from navigating loneliness and relationships to asking existential questions.
Tweet media one
0
6
72
@saffronhuang
Saffron Huang
2 months
Bonus: When are warnings actually effective?. I used Claude to create a deep research report on "When do warnings about the future become a self fulfilling prophecy vs. actually preventing the situation?". Based on research & past case studies, the key difference is not in the.
1
0
5
@saffronhuang
Saffron Huang
2 months
What if we mapped causal chains and leverage points instead, to show people what can actually be done, instead of making a super-smooth, neat narrative?. (Apparently the AI 2027 folks are working on this, which is great)
Tweet media one
1
0
3
@saffronhuang
Saffron Huang
2 months
I see narratives that underrate coordination and consideration of institutions as particularly dangerous, because it underrates human agency
Tweet media one
1
0
2
@saffronhuang
Saffron Huang
2 months
If your uncertainty is so high, why anchor us to this specific line?
Tweet media one
2
0
2
@saffronhuang
Saffron Huang
2 months
Are predictions useful without an explanation that drives them?
Tweet media one
2
0
2
@saffronhuang
Saffron Huang
2 months
The mixed signals problem of "we're serious researchers!" + "but don't take this too seriously!"
Tweet media one
1
0
2
@saffronhuang
Saffron Huang
2 months
Newest @reboot_hq 🎙️post: @jessicadai_ and I discuss forecasting, and how people present unhelpful narratives about the future (mostly by picking on AI 2027, sorry guys). Why we should view the future as constructed, not predicted
Tweet media one
4
11
56
@saffronhuang
Saffron Huang
2 months
RT @IasonGabriel: Check out this work by @saffronhuang – one of the best researchers thinking about the ethical & societal impacts of AGI.
0
2
0
@saffronhuang
Saffron Huang
2 months
cc @IasonGabriel thanks for the nudge to make things more legible :~).
0
0
4
@saffronhuang
Saffron Huang
2 months
I updated my personal website! I felt like it was pretty hard to explore before, and I wanted to actually properly highlight the work/ideas that I want people to read and that I stand behind. Will keep tweaking, but have a look. :).
Tweet media one
10
4
238
@saffronhuang
Saffron Huang
3 months
glad to have shipped the values work to the system card for claude 4 :)
Tweet media one
0
1
96
@saffronhuang
Saffron Huang
3 months
RT @collect_intel: Our Research Director @zarinahagnew just sent out the fourth Global Dialogues round!. Some things we're asking people fr….
0
3
0
@saffronhuang
Saffron Huang
3 months
RT @jackclarkSF: Want to study the economic impact of AI and influence the policy choices a frontier lab makes? I'm building a team to adva….
Tweet card summary image
job-boards.greenhouse.io
0
62
0