
Been Kim
@_beenkim
Followers
25K
Following
2K
Media
91
Statuses
787
Research Scientist at Google DeepMind, PhD from MIT. Make machines empower people.
Joined August 2011
‼️Skibidi for Machines! :) . Developing language 🔠 between humans🧒 and machines🤖 has long been a dream - the language that will help us expand what we know so that we can communicate with machines better, and create machines better align with us. With @johnhewtt's amazing.
Understanding and control are two sides of the problem of communicating differing concepts between humans and machines. New position paper: Robert Geirhos, @_beenkim, and I argue we must develop neologisms - new words - for human and machine concepts to understand and control AI
3
12
90
RT @ActInterp: The one and only @_beenkim on Agentic Interpretability and Neologism: What LLMs Can Offer Us!
0
3
0
RT @johnhewtt: Come chat with me at our ICML poster about interpretability as a communication problem, and the need to derive new words for….
0
8
0
RT @ActInterp: Join us at 09:10 for Been’s talk on Agentic Interpretability and Neologism: what LLMs can offer us.
0
2
0
RT @WiMLworkshop: Talk panel“The Unsolved Problems of AI: What We’re Still Getting Wrong” 🤦♀️.Featuring Ana Lucic, Been Kim, Furong Huang,….
0
6
0
RT @NithishKannen: Say hi 👋to @ziwphd ,@MeeraHahn, @_beenkim, Wenjun Zeng and Rich Galt at #ICML2025 if you're interested in learning more….
arxiv.org
User prompts for generative AI models are often underspecified, leading to a misalignment between the user intent and models' understanding. As a result, users commonly have to painstakingly...
0
4
0
Going to #ICML2025 tomorrow - Sat morning! If you are graduating this year with a PhD, and you are interested in Google DeepMind, send me a message. Where I will be: . 0. Wed 1pm @WiMLworkshop Mentoring Table and 2pm panel discussion. 1. Wed 4:30pm East Exhibition Hall A-B.
2
5
129
We (@_beenkim @johnhewtt @NeelNanda5 Noah Fiedel Oyvind Tafjord) propose a research direction called 🤖agentic interpretability: we can and should ask and help AI systems to build mental models of us which will help us to build mental models of the LLMs.
9
32
220
Excited to give the opening keynote at bidirectional human AI alignment workshop time at 9am! . And adding a picture with @ylecun after being stopped every second by folks who wanted to take pictures with him (we were just walking out of ICLR board meeting). The peer pressure was
🚀 #ICLR2025 & #CHI2025 are just around the corner — and we’re excited to welcome you to our Bidirectional 👫Human-AI🤖 Alignment events!. 🏅 "Golden Sponsors"🏅. A heartfelt thank you to our two generous Golden Sponsors: 🌟@Prolific and 🌟@Layer6AI of @TDBank_US! . Their
5
6
206
RT @iclr_conf: Huge thanks to the #ICLR2025 Organizing Committee (including many who couldn't make it to the conference) 👏👏👏 .
0
27
0
See you in Singapore! #ICLR2025.
🚀 #ICLR2025 & #CHI2025 are just around the corner — and we’re excited to welcome you to our Bidirectional 👫Human-AI🤖 Alignment events!. 🏅 "Golden Sponsors"🏅. A heartfelt thank you to our two generous Golden Sponsors: 🌟@Prolific and 🌟@Layer6AI of @TDBank_US! . Their
0
1
19
RT @PrincetonAInews: As AI becomes increasingly complex, new language is needed to improve communication between humans and machines, @Goog….
0
2
0
♟️♟️Now our work on teaching superhuman chess strategies to grandmasters (one of whom @DGukesh who became the latest and the youngest world chess champion) is published on PNAS! 🎉🎉. Yes, we can transfer machine knowledge to humans to push the frontier of human. knowledge.
Excited to share that our paper "Bridging the human–AI knowledge gap through concept discovery and transfer in AlphaZero" is now out in PNAS! . With @weballergy, @banburismus_, @demishassabis, @ulrichpaquet, @_beenkim 🎉. 📄
0
19
156