
Tom Bush
@_tom_bush
Followers
144
Following
53
Media
12
Statuses
34
AI Alignment and Interpretability | Research Scholar @ MATS | Incoming DPhil @ Oxford
Joined November 2023
Attending #ICLR2025 to present this research - please reach out if you want to chat about interp, reasoning, or anything else!!.
🤖 Model-free agents can internally plan!. Sokoban agents develop bidirectional search planning! .🔬 We probe for planning concepts .⚙️ Investigate plan formation .✅ Verify plans impact behavior . Chat with us at #ICLR2025!.📍 Apr 24: Poster 10am + Oral 4:06pm SGT
0
2
17
RT @_fernando_rosas: Preprint time:.“AI in a vat: Fundamental limits of efficient world modelling for agent sandboxing and interpretability….
0
20
0
This research would not have been possible without the amazing support and mentorship I recieved from my brilliant co-authors @Stephen36910351 , @usmananwar391 , @AdriGarriga and @DavidSKrueger. Paper: Blog post:
0
0
6
This research would not have been possible without the amazing support and mentorship I recieved from my brilliant co-authors @Stephen36910351 , @usmananwar391 , @AdriGarriga and @DavidSKrueger . Paper: Blog post:
0
0
6