AllanDafoe Profile Banner
Allan Dafoe Profile
Allan Dafoe

@AllanDafoe

Followers
4K
Following
4K
Media
11
Statuses
152

AGI governance: navigating the transition to beneficial AGI (Google DeepMind)

Joined August 2012
Don't wanna be here? Send us removal request.
@AllanDafoe
Allan Dafoe
4 days
Jade is such an excellent choice for this role! Great to have such a brilliant, wise, dedicated expert filling an important policy role.
@matthewclifford
Matt Clifford
4 days
Absolutely delighted about this - major upgrade on the last AI adviser! Jade brings a tonne of experience in frontier labs, VC and government and will do an amazing job of ensuring the UK is an AI winner. Excellent news.
Tweet media one
0
2
62
@AllanDafoe
Allan Dafoe
6 months
Insightful analysis of the implications of inference scaling for AI/AGI governance. A must read.
@tobyordoxford
Toby Ord
6 months
New paper:.Inference Scaling Reshapes AI Governance.The shift from scaling up the pre-training compute of AI systems to scaling up their inference compute may have profound effects on AI governance. đź§µ.1/.
0
0
15
@grok
Grok
8 days
Generate videos in just a few seconds. Try Grok Imagine, free for a limited time.
419
690
3K
@AllanDafoe
Allan Dafoe
6 months
Thanks Rob for a great conversation about important topics: why technology drives history, and the rare opportunity of steering it.
@robertwiblin
Rob Wiblin
6 months
My ep w @AllanDafoe (director of frontier safety & governance at DeepMind):. "Tech doesn't force us to do anything, it merely opens the door – and it's military-economic competition that forces us through." (32:25). "We're not at peak returns to generality." (1:38:06). "The
2
8
62
@AllanDafoe
Allan Dafoe
6 months
RT @_lewisho: We updated our framework to include a section addressing deceptive alignment/loss of control risks. There's much more work to….
0
1
0
@AllanDafoe
Allan Dafoe
6 months
RT @ZacKenton1: We're hiring for our Google DeepMind AGI Safety & Alignment and Gemini Safety teams. Locations: London, NYC, Mountain View,….
job-boards.greenhouse.io
0
37
0
@AllanDafoe
Allan Dafoe
7 months
Great work by @_lewisho, Celine Smith, Claudia van der Salm, @JoslynBarnhart, @rohinshah, @four, Jen Beroshi, @ancadianadragan, @ShaneLegg, Helen King, Tom Lue, and many others.
Tweet media one
0
0
6
@AllanDafoe
Allan Dafoe
7 months
Many others are putting out frameworks for frontier safety, following the Seoul AI Safety Commitments. Safety scientists, policy experts, government, and industry should now identify best practice, to build standards for safe frontier AI development.
Tweet card summary image
gov.uk
1
0
4
@AllanDafoe
Allan Dafoe
7 months
The effort involved close collaboration with our impressive security teams, in GDM and Google, to specify requisite levels for security mitigations which could be used for industry best-practice, and to map critical capability levels to security levels.
1
0
3
@AllanDafoe
Allan Dafoe
7 months
With v2 I'm especially grateful for the extensively developed framework on deceptive alignment risk, which I believe will be industry leading.
3
1
4
@AllanDafoe
Allan Dafoe
7 months
I'm proud of GoogleDeepMind/Google's v2 update to our Frontier Safety Framework. We were the first major tech company to produce an explicit risk management framework for extreme risks, and I'm glad we are continuing to push ahead on safety best practice.
Tweet card summary image
deepmind.google
Our next iteration of the FSF sets out stronger security protocols on the path to AGI
3
18
119
@AllanDafoe
Allan Dafoe
10 months
Wonderful to see the benefits of AI for science being recognized with a(nother) Nobel! Congrats to Demis, John, David, and to all working to unlock AI for science.
@NobelPrize
The Nobel Prize
10 months
BREAKING NEWS.The Royal Swedish Academy of Sciences has decided to award the 2024 #NobelPrize in Chemistry with one half to David Baker “for computational protein design” and the other half jointly to Demis Hassabis and John M. Jumper “for protein structure prediction.”
Tweet media one
0
2
35
@AllanDafoe
Allan Dafoe
11 months
Seniority: we are open to hiring at virtually any seniority level. Our priority is to find excellent, driven, people who can help GDM and the world make sense of and guide frontier AI. Please apply! OR contact-frontier-safety-governance@google.com.
job-boards.greenhouse.io
0
0
11
@AllanDafoe
Allan Dafoe
11 months
Geography: we have a preference for people who can join in London, though we also have hubs in NYC and the Bay Area.
1
0
5
@AllanDafoe
Allan Dafoe
11 months
How does a model's architecture, training compute, agent affordances relate to risk and appropriate guardrails? How to systematically forecast dangerous capabilities and AGI? Principled risk assessment for open weight models? Geo-strategic aspects of different kinds of compute?.
1
0
3
@AllanDafoe
Allan Dafoe
11 months
Generalist technical governance: many core issues lay in the intersection of technical ML/safety and governance, for which we would look for an agile thinker and generalist, comfortable with technical topics, enthusiasm to advance understanding. For example:.
1
0
1
@AllanDafoe
Allan Dafoe
11 months
Agent governance and safety: (semi-)autonomous agents are coming, how should they be deployed and governed? How should society manage levels of autonomy and delegated authority? How to get benefits of personalization and generality, while maintaining privacy, security, safety?.
1
0
2
@AllanDafoe
Allan Dafoe
11 months
Global and industry governance: updating global institutions and industry governance for powerful, transformative and socially beneficial AI; bringing (technical) frontier safety advice to stakeholder conversations; safety norms through FMF, AI summits, etc.
1
0
3
@AllanDafoe
Allan Dafoe
11 months
Geopolitics and AGI efforts: the path to AGI is increasingly shaped by geopolitical forces, and could see major public and private sector efforts. How will these forces evolve? Can we blueprint such an effort to be competent, safe, democracy-enhancing, globally beneficial?.
2
0
6
@AllanDafoe
Allan Dafoe
11 months
Forecasting powerful AI and AGI: work could include ML expert interviews, analyzing commissioned superforecasters, Epoch-style trend extrapolation, identification of strategic cruxes, economic analysis, decomposition and targeted forecasting of milestones to powerful AI.
Tweet media one
1
1
10