Been Kim Profile
Been Kim

@_beenkim

Followers
25K
Following
2K
Media
91
Statuses
787

Research Scientist at Google DeepMind, PhD from MIT. Make machines empower people.

Joined August 2011
Don't wanna be here? Send us removal request.
@_beenkim
Been Kim
6 months
‼️Skibidi for Machines! :) . Developing language 🔠 between humans🧒 and machines🤖 has long been a dream - the language that will help us expand what we know so that we can communicate with machines better, and create machines better align with us. With @johnhewtt's amazing.
@johnhewtt
John Hewitt
6 months
Understanding and control are two sides of the problem of communicating differing concepts between humans and machines. New position paper: Robert Geirhos, @_beenkim, and I argue we must develop neologisms - new words - for human and machine concepts to understand and control AI
Tweet media one
3
12
90
@_beenkim
Been Kim
1 month
RT @ActInterp: The one and only @_beenkim on Agentic Interpretability and Neologism: What LLMs Can Offer Us!
Tweet media one
0
3
0
@grok
Grok
4 days
Join millions who have switched to Grok.
211
240
2K
@_beenkim
Been Kim
1 month
RT @johnhewtt: Come chat with me at our ICML poster about interpretability as a communication problem, and the need to derive new words for….
0
8
0
@_beenkim
Been Kim
1 month
RT @ActInterp: Join us at 09:10 for Been’s talk on Agentic Interpretability and Neologism: what LLMs can offer us.
0
2
0
@_beenkim
Been Kim
1 month
RT @WiMLworkshop: Talk panel“The Unsolved Problems of AI: What We’re Still Getting Wrong” 🤦‍♀️.Featuring Ana Lucic, Been Kim, Furong Huang,….
0
6
0
@_beenkim
Been Kim
1 month
Going to #ICML2025 tomorrow - Sat morning! If you are graduating this year with a PhD, and you are interested in Google DeepMind, send me a message. Where I will be: . 0. Wed 1pm @WiMLworkshop Mentoring Table and 2pm panel discussion. 1. Wed 4:30pm East Exhibition Hall A-B.
2
5
129
@_beenkim
Been Kim
2 months
In this paper, we layout some core properties of agentic interpretability (i.e., proactive assistance, multi-turn interaction, mutual mental model), provide some real and some imagined examples of agentic interpretability (e.g., sec 2.2.3 `open-model surgery’ idea where we can.
0
0
2
@_beenkim
Been Kim
2 months
A method is agentic interpretability if it `proactively assists human understanding in a multi-turn interactive process by developing and leveraging a mental model of the user which in turn enables humans to develop better mental models of the LLM’. In other words, enable.
1
0
6
@_beenkim
Been Kim
2 months
By keeping up with the evolution of machines, we believe that this can help increasing human empowerment and helping reduce bad outcomes like gradual disempowerment (.
1
0
2
@_beenkim
Been Kim
2 months
The idea of agentic interpretability stems from the observation that AI systems are becoming increasingly adept at communicating with us, verbalizing their thoughts, and providing explanations. So why not enable them to help us understand them?.
1
0
2
@_beenkim
Been Kim
2 months
We (@_beenkim @johnhewtt @NeelNanda5 Noah Fiedel Oyvind Tafjord) propose a research direction called 🤖agentic interpretability: we can and should ask and help AI systems to build mental models of us which will help us to build mental models of the LLMs.
Tweet media one
9
32
220
@_beenkim
Been Kim
4 months
Excited to give the opening keynote at bidirectional human AI alignment workshop time at 9am! . And adding a picture with @ylecun after being stopped every second by folks who wanted to take pictures with him (we were just walking out of ICLR board meeting). The peer pressure was
Tweet media one
@huashen218
Hua Shen✨
4 months
🚀 #ICLR2025 & #CHI2025 are just around the corner — and we’re excited to welcome you to our Bidirectional 👫Human-AI🤖 Alignment events!. 🏅 "Golden Sponsors"🏅. A heartfelt thank you to our two generous Golden Sponsors: 🌟@Prolific and 🌟@Layer6AI of @TDBank_US! . Their
Tweet media one
5
6
206
@_beenkim
Been Kim
4 months
RT @iclr_conf: Huge thanks to the #ICLR2025 Organizing Committee (including many who couldn't make it to the conference) 👏👏👏 .
0
27
0
@_beenkim
Been Kim
4 months
See you in Singapore! #ICLR2025.
@huashen218
Hua Shen✨
4 months
🚀 #ICLR2025 & #CHI2025 are just around the corner — and we’re excited to welcome you to our Bidirectional 👫Human-AI🤖 Alignment events!. 🏅 "Golden Sponsors"🏅. A heartfelt thank you to our two generous Golden Sponsors: 🌟@Prolific and 🌟@Layer6AI of @TDBank_US! . Their
Tweet media one
0
1
19
@_beenkim
Been Kim
5 months
RT @PrincetonAInews: As AI becomes increasingly complex, new language is needed to improve communication between humans and machines, @Goog….
0
2
0
@_beenkim
Been Kim
5 months
RT @ziwphd: Thrilled and grateful for the invitation from @HDRUNDP to share our research on proactive AI. Such a valuable opportunity to d….
0
3
0
@_beenkim
Been Kim
5 months
♟️♟️Now our work on teaching superhuman chess strategies to grandmasters (one of whom @DGukesh who became the latest and the youngest world chess champion) is published on PNAS! 🎉🎉. Yes, we can transfer machine knowledge to humans to push the frontier of human. knowledge.
@miouantoinette
Lisa Schut
5 months
Excited to share that our paper "Bridging the human–AI knowledge gap through concept discovery and transfer in AlphaZero" is now out in PNAS! . With @weballergy, @banburismus_, @demishassabis, @ulrichpaquet, @_beenkim 🎉. 📄
0
19
156
@_beenkim
Been Kim
5 months
On the gorgeous Princeton campus, I'm excited to give a distinguished lecture tomorrow! This nice poster they gifted me shows my entire wardrobe—I’ll be wearing that jacket. Again.😅😅
Tweet media one
3
3
130
@_beenkim
Been Kim
5 months
RT @JeffDean: 🥁Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding. Now integ….
0
315
0