
Alex Turner
@Turn_Trout
Followers
3K
Following
347
Media
33
Statuses
279
Research scientist on the scalable alignment team at Google DeepMind. All views are my own. https://t.co/b2PuVgl41D
Berkeley, CA
Joined December 2021
The "sleeper agent" terminology is hyperbolic and unfortunate IMO. Crying wolf. Should have reserved such an aggressive title for *actually finding dangerous sleeper agents*. But hey, it got a lot of attention.
@CongressmanRaja @AnthropicAI @jackclarkSF @MarkBeall Dunn (R-FL): Asks about Jack Clark's substack. Also asks about the @AnthropicAI / @redwood_ai paper on Sleeper Agents. @jackclarkSF confirms. If you thought that Anthropic/Redwood's approach of publishing papers lacked policy impact. well, update your beliefs.
3
4
43