
Tom Everitt
@tom4everitt
Followers
2K
Following
10K
Media
14
Statuses
220
AGI safety researcher at @GoogleDeepMind, leading https://t.co/gBAjHPCTcL switching to https://t.co/aIojqeDT0I
London
Joined August 2017
pretty cool effort to distribute economic power in the age of powerful AI.
With @luke_drago_, I’m cofounding Workshop Labs, a public benefit corporation preventing human disempowerment from AI. See below for:.-impact case.-what we’re building.-what we hope the future looks like.-what we’re hiring for.
0
0
3
RT @MichaelD1729: Someone needs to use this as the basis of an unsupervised environment design algorithm to give AI designers direct contro….
0
2
0
Causality is about predicting how interventions affect outcomes. Can we use causality to predict how environment changes affect agent behavior?. We explore this idea in a new paper.
Can we trust a black-box system, when all we know is its past behaviour? 🤖🤔.In a new #ICML2025 paper we derive fundamental bounds on the predictability of black-box agents. This is a critical question for #AgentSafety. 🧵
2
1
20
new world models paper, this time with task-generality rather than robustness.
Are world models necessary to achieve human-level agents, or is there a model-free short-cut?.Our new #ICML2025 paper tackles this question from first principles, and finds a surprising answer, agents _are_ world models… 🧵
1
1
16
Link to paper: Joint work with: @ggarbacea @alexis_bellot_ @jonathanrichens @HenryPapadatos @Simeon_Cps @rohinmshah from @googledeepmind @UChicago and SaferAI.
1
1
6
RT @F_Rhys_Ward: In real-life, agents with different subjective beliefs interact in a shared objective reality. They have higher-order beli….
0
12
0
One thing that I really like about this is that my content is much less determined by who I follow, than by which posts I like. This means I can express my approval for a post, without worrying that similar content will now flood my feed.
Instead, there's a market place of content selection algorithms. My favourites are.* "Following": simple chronological feed (default).* "Quiet posters": posts from less frequent posters in your feed.* "Paper Skygest": posts about papers.
0
0
1