
Saffron Huang
@saffronhuang
Followers
6K
Following
4K
Media
91
Statuses
1K
how shall we live together? societal impacts researcher @AnthropicAI • @collect_intel co-founder • ex @AISecurityInst @GoogleDeepMind⋅ views mine
sf
Joined April 2013
Really proud and excited to release this work on empirically measuring AI values “in the wild” — understanding, analyzing and taxonomizing what values guide model outputs in real interactions with real users. There is a lot of work on training models to follow particular.
New Anthropic research: AI values in the wild. We want AI models to have well-aligned values. But how do we know what values they’re expressing in real-life conversations?. We studied hundreds of thousands of anonymized conversations to find out.
7
22
251
RT @claudeai: Today we're launching new ways to learn in Claude Code and the Claude app. First up: Claude Code now lets you customize comm���.
0
578
0
RT @collect_intel: @divyasiddarth and @audreyt joined @reidhoffman and @AriaIrene, hosts of the Possible Podcast, to talk about one of the….
0
10
0
RT @hannahrosekirk: My team at @AISecurityInst is hiring! This is an awesome opportunity to get involved with cutting-edge scientific resea….
0
24
0
RT @AlexTamkin: Highly recommend applying if you have ideas for understanding AI's economic impacts or policy proposals! .
0
9
0
RT @AnthropicAI: Announcing the Anthropic Economic Futures Program—our latest commitment to understanding AI's impacts on work and the econ….
0
193
0
it’s really important to know both Claude’s “EQ” not just “IQ” (for lack of better terminology) — take a look at the findings below.
New Anthropic Research: How people use Claude for emotional support. From millions of anonymized conversations, we studied how adults use AI for emotional and personal needs—from navigating loneliness and relationships to asking existential questions.
0
6
72
Full conversation:
joinreboot.org
Listen now (60 mins) | Saffron Huang on human agency in AI forecasting
1
0
2
Newest @reboot_hq 🎙️post: @jessicadai_ and I discuss forecasting, and how people present unhelpful narratives about the future (mostly by picking on AI 2027, sorry guys). Why we should view the future as constructed, not predicted
4
11
56
RT @IasonGabriel: Check out this work by @saffronhuang – one of the best researchers thinking about the ethical & societal impacts of AGI.
0
2
0
RT @collect_intel: Our Research Director @zarinahagnew just sent out the fourth Global Dialogues round!. Some things we're asking people fr….
0
3
0
RT @jackclarkSF: Want to study the economic impact of AI and influence the policy choices a frontier lab makes? I'm building a team to adva….
job-boards.greenhouse.io
0
62
0