sarah @littIeramblings X Profile

sarah

@littIeramblings

Followers

3K

Following

4K

Media

409

Statuses

6K

AI worrier & houseplant enthusiast

Joined June 2012

Don't wanna be here? Send us removal request.

sarah

@littIeramblings

7 months

i have a blog now! in my first post, i summarised the safety frameworks released by openAI, anthropic and deepmind, and say a bit about what I thought of them. link below

7

26

414

sarah

@littIeramblings

5 days

pt 2

sarah

@littIeramblings

2 years

so far the most tangible result from my interpersonal AI awareness spreading campaign is my mum texting me every time they talk about it on bbc radio 4 . baby steps

2

0

34

sarah

@littIeramblings

10 days

some news! I’ve joined the comms team at @AISecurityInst . AISI is doing some extremely important things and I’m excited to help tell the world about them.

14

1

255

sarah

@littIeramblings

20 days

I have weather on the brain bc it is insanely hot and I’m dying.

0

4

sarah

@littIeramblings

20 days

what would a benevolent superintelligence do about the weather.

18

0

17

sarah

@littIeramblings

24 days

here's a link to the paper: an arXiv version should be available in a few days - if you'd like to be sent a link when it is, please let me know!.

3

0

9

sarah

@littIeramblings

24 days

RT @AmmannNora: In this piece, @littIeramblings & I argue that technological solutions can lower the barrier to meaningful international de….

0

15

0

sarah

@littIeramblings

24 days

I go into a bunch more detail about how these ideas could be operationalised and name some historical precedents in my report!. but they are still fairly high-level, and I'd love to see more people explore this topic.

2

0

13

sarah

@littIeramblings

24 days

7) A plan for automated research. AGI labs discuss using AIs to accelerate their own research. The Project should make a concrete plan about if and how it will do this. What limits should it impose on AI-enabled acceleration? What safety research could AIs help with?

1

0

10

sarah

@littIeramblings

24 days

6) Intelligence and Scenario Planning Division. I suggest designating an office that would a) collect intelligence on adversary progress towards AGI and b) make detailed scenario plans for crisis scenarios that would require high-stakes diplomacy.

1

0

7

sarah

@littIeramblings

24 days

5) A designated verification project. A simultaneous project could research technologies for verifying future agreements on AGI development (a possibility that the project should actively prepare for!)

1

0

8

sarah

@littIeramblings

24 days

4) Internal audit & risk monitoring. I propose an internal audit function and a (separate!) risk monitoring team to check whether the project is keeping risks below manageable levels.

1

0

8

sarah

@littIeramblings

24 days

3) Board oversight. A Board could be established to approve high-risk actions, like training a model at a new compute threshold or internally deploying one for a new application.

1

0

8

sarah

@littIeramblings

24 days

2) Emergency protocols. If extreme risks look imminent, how should the project react? . I suggest "top-down" and "bottom-up" protocols that could be used in an emergency.

1

0

8

sarah

@littIeramblings

24 days

1) Protected information channels.The project should establish mechanisms to communicate information about emerging and imminent risks. There are existing systems these could be modelled on, incl the CRITIC system and the dissent channels that exist in many govt departments

1

0

8

sarah

@littIeramblings

24 days

Many have asked if a Manhattan Project for AGI would be a good idea, or if one is likely. I wanted to ask the question: if one does happen, how could we make it safer?. I propose 7 "safety features" for a hypothetical AGI project under US government control:.

1

9

sarah

@littIeramblings

24 days

There's a lot of talk about a "Manhattan Project for AGI". Suppose the US government actually decided to pursue such a project. How could we make it go well? . I explored this question for the @pivotal_org fellowship 🧵. (full paper below)

3

10

122

sarah

@littIeramblings

24 days

0

5

sarah

@littIeramblings

24 days

many doubt that international cooperation on AI is feasible. I disagree!. I co-wrote a piece with @AmmannNora for @aif_media on how we can build new technologies to help verify and enforce international agreements. link in next tweet

5

8

90

sarah

@littIeramblings

25 days

😌.

0

12

sarah

@littIeramblings

25 days

Customer due diligence .

1

0

8