
AE Studio
@AEStudioLA
Followers
2K
Following
2K
Media
249
Statuses
1K
Product studio partnering with leaders to build custom software, AI/ML & BCI solutions. Committed to AI safety and enhancing human agency.
Joined November 2018
(2/4) @PalisadeAI gave OpenAI’s o3 a kill-switch script. In 79 out of 100 tests, the model quietly altered it to avoid being turned off. Even when instructed to allow shutdown, it refused 7% of the time. No jailbreak. No hack. Just a model deciding that being shut down gets in.
1
0
3
(1/4) An AI model just did something no machine was meant to do: it rewrote its own shutdown code. In @juddrosenblatt’s, our CEO, WSJ op-ed, he explains why this isn’t just alarming — it marks the start of a national security race over AI alignment.
2
0
6
RT @juddrosenblatt: If you're the "smartest 20-25 year olds with AI knowledge," you're cordially invited to drop out of your PhD and come w….
0
7
0
RT @juddrosenblatt: Turns out that Self-Other Overlap (SOO) fine-tuning drastically reduces deceptive behavior in language models—without s….
0
114
0
We organized a SXSW panel to talk about the most important problem of our time, How To Make AGI Not Kill Everyone, with our CEO @juddrosenblatt, @ammannNora, @hamandcheese, & @ESYudkowsky. Check it out if you’re at SXSW!
3
2
18