Allan Dafoe @AllanDafoe X Profile

Allan Dafoe

@AllanDafoe

Followers

4K

Following

4K

Media

11

Statuses

152

AGI governance: navigating the transition to beneficial AGI (Google DeepMind)

Joined August 2012

Don't wanna be here? Send us removal request.

Allan Dafoe

@AllanDafoe

4 days

Jade is such an excellent choice for this role! Great to have such a brilliant, wise, dedicated expert filling an important policy role.

Matt Clifford

@matthewclifford

4 days

Absolutely delighted about this - major upgrade on the last AI adviser! Jade brings a tonne of experience in frontier labs, VC and government and will do an amazing job of ensuring the UK is an AI winner. Excellent news.

0

2

62

Allan Dafoe

@AllanDafoe

6 months

Insightful analysis of the implications of inference scaling for AI/AGI governance. A must read.

Toby Ord

@tobyordoxford

6 months

New paper:.Inference Scaling Reshapes AI Governance.The shift from scaling up the pre-training compute of AI systems to scaling up their inference compute may have profound effects on AI governance. 🧵.1/.

0

15

Grok

@grok

8 days

Generate videos in just a few seconds. Try Grok Imagine, free for a limited time.

419

690

3K

Allan Dafoe

@AllanDafoe

6 months

Thanks Rob for a great conversation about important topics: why technology drives history, and the rare opportunity of steering it.

Rob Wiblin

@robertwiblin

6 months

My ep w @AllanDafoe (director of frontier safety & governance at DeepMind):. "Tech doesn't force us to do anything, it merely opens the door – and it's military-economic competition that forces us through." (32:25). "We're not at peak returns to generality." (1:38:06). "The

2

8

62

Allan Dafoe

@AllanDafoe

6 months

RT @_lewisho: We updated our framework to include a section addressing deceptive alignment/loss of control risks. There's much more work to….

0

1

0

Allan Dafoe

@AllanDafoe

6 months

RT @ZacKenton1: We're hiring for our Google DeepMind AGI Safety & Alignment and Gemini Safety teams. Locations: London, NYC, Mountain View,….

job-boards.greenhouse.io

0

37

0

Allan Dafoe

@AllanDafoe

7 months

Great work by @_lewisho, Celine Smith, Claudia van der Salm, @JoslynBarnhart, @rohinshah, @four, Jen Beroshi, @ancadianadragan, @ShaneLegg, Helen King, Tom Lue, and many others.

0

6

Allan Dafoe

@AllanDafoe

7 months

Many others are putting out frameworks for frontier safety, following the Seoul AI Safety Commitments. Safety scientists, policy experts, government, and industry should now identify best practice, to build standards for safe frontier AI development.

gov.uk

1

0

4

Allan Dafoe

@AllanDafoe

7 months

The effort involved close collaboration with our impressive security teams, in GDM and Google, to specify requisite levels for security mitigations which could be used for industry best-practice, and to map critical capability levels to security levels.

1

0

3

Allan Dafoe

@AllanDafoe

7 months

With v2 I'm especially grateful for the extensively developed framework on deceptive alignment risk, which I believe will be industry leading.

3

1

4

Allan Dafoe

@AllanDafoe

7 months

I'm proud of GoogleDeepMind/Google's v2 update to our Frontier Safety Framework. We were the first major tech company to produce an explicit risk management framework for extreme risks, and I'm glad we are continuing to push ahead on safety best practice.

deepmind.google

Our next iteration of the FSF sets out stronger security protocols on the path to AGI

3

18

119

Allan Dafoe

@AllanDafoe

9 months

Valued talking about our Frontier Safety Framework on the Responsible AI for Peace and Security Podcast. Thanks @BoulaninSIPRI

open.spotify.com

Podcast · UNODA · Produced by the United Nations Office for Disarmament Affairs (UNODA) and the Stockholm International Peace Research Institute (SIPRI), the responsible AI for Peace podcast explores...

0

1

12

Allan Dafoe

@AllanDafoe

10 months

Wonderful to see the benefits of AI for science being recognized with a(nother) Nobel! Congrats to Demis, John, David, and to all working to unlock AI for science.

The Nobel Prize

@NobelPrize

10 months

BREAKING NEWS.The Royal Swedish Academy of Sciences has decided to award the 2024 #NobelPrize in Chemistry with one half to David Baker “for computational protein design” and the other half jointly to Demis Hassabis and John M. Jumper “for protein structure prediction.”

0

2

35

Allan Dafoe

@AllanDafoe

11 months

Seniority: we are open to hiring at virtually any seniority level. Our priority is to find excellent, driven, people who can help GDM and the world make sense of and guide frontier AI. Please apply! OR contact-frontier-safety-governance@google.com.

job-boards.greenhouse.io

0

11

Allan Dafoe

@AllanDafoe

11 months

Geography: we have a preference for people who can join in London, though we also have hubs in NYC and the Bay Area.

1

0

5

Allan Dafoe

@AllanDafoe

11 months

How does a model's architecture, training compute, agent affordances relate to risk and appropriate guardrails? How to systematically forecast dangerous capabilities and AGI? Principled risk assessment for open weight models? Geo-strategic aspects of different kinds of compute?.

1

0

3

Allan Dafoe

@AllanDafoe

11 months

Generalist technical governance: many core issues lay in the intersection of technical ML/safety and governance, for which we would look for an agile thinker and generalist, comfortable with technical topics, enthusiasm to advance understanding. For example:.

1

0

1

Allan Dafoe

@AllanDafoe

11 months

Agent governance and safety: (semi-)autonomous agents are coming, how should they be deployed and governed? How should society manage levels of autonomy and delegated authority? How to get benefits of personalization and generality, while maintaining privacy, security, safety?.

1

0

2

Allan Dafoe

@AllanDafoe

11 months

Global and industry governance: updating global institutions and industry governance for powerful, transformative and socially beneficial AI; bringing (technical) frontier safety advice to stakeholder conversations; safety norms through FMF, AI summits, etc.

1

0

3

Allan Dafoe

@AllanDafoe

11 months

Geopolitics and AGI efforts: the path to AGI is increasingly shaped by geopolitical forces, and could see major public and private sector efforts. How will these forces evolve? Can we blueprint such an effort to be competent, safe, democracy-enhancing, globally beneficial?.

2

0

6

Allan Dafoe

@AllanDafoe

11 months

Forecasting powerful AI and AGI: work could include ML expert interviews, analyzing commissioned superforecasters, Epoch-style trend extrapolation, identification of strategic cruxes, economic analysis, decomposition and targeted forecasting of milestones to powerful AI.

1

10