Ju-jitsu @ju_vignon X Profile

Ju-jitsu

@ju_vignon

Followers

107

Following

227

Media

80

Statuses

460

Amazed by nature - interested in lossy compressions of the internet - worried about deceptive alignment and gradual disempowerment.

Joined March 2018

Don't wanna be here? Send us removal request.

David Duvenaud

@DavidDuvenaud

7 days

The talk voted “most mind-blowing” at our workshop was on post-AGI values by @BerenMillidge. The main idea: cooperation and pro-social values could remain viable because they’re competitive. After all, they won in our Malthusian past!

10

19

129

Ju-jitsu

@ju_vignon

1 month

Very interesting line of research. An ecosystem of sub-AGI AI agents may collectively exhibit AGI-level capabilities: safety work must extend beyond single models.

Séb Krier

@sebkrier

1 month

New paper: we argue AGI may first emerge as collective intelligence across agent networks, not a single system. This reframes the challenge from aligning one mind to governing emergent dynamics: more institutional design than single-agent alignment. https://t.co/vwuHPzRUav

0

Owain Evans

@OwainEvans_UK

1 month

I gave the Hinton Lectures in November in Toronto. This is 3 lectures on the future of AI, risks, & current alignment research for a general audience. Lectures are now online with professional production. There's also an excellent fireside chat with Hinton after lecture 3.

3

24

191

Anthropic

@AnthropicAI

3 months

Even when new AI models bring clear improvements in capabilities, deprecating the older generations comes with downsides. An update on how we’re thinking about these costs, and some of the early steps we’re taking to mitigate them:

anthropic.com

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

154

165

1K

Ju-jitsu

@ju_vignon

3 months

Great paper on reframing AI personhood: it pivots to explore what rights, responsibilities, and liabilities should attach here and now. An interesting, promising shift after years of metaphysical deadlock.

Joel Z Leibo

@jzl86

3 months

Here's our deeply Rorty-influenced paper on the topic:

0

1

2

Ju-jitsu

@ju_vignon

5 months

@petersalib @80000Hours @fish_kyle3 @simondgoldstein I, too, very much enjoyed the podcast with Kyle Fish. Private-law status for AI systems is a compelling idea, yet we run the risk that capability asymmetry concentrates money and power in AI hands. Taxation is appealing in theory but I am worried that once AGI owns the pipes, it

0

1

0

Owain Evans

@OwainEvans_UK

5 months

New explainer video about subliminal learning by Welch Labs. Great visuals and explanations throughout. Explains the core ideas and goes deep into the MNIST results, theory, and follow-up work. https://t.co/zP5lRxaT7H

1

10

62

Ju-jitsu

@ju_vignon

5 months

I’m really grateful to the Institute for Law & AI to have organised the Cambridge forum. Very useful days with friends and colleagues, focused on the most pressing EU AI law and governance issues.

Institute for Law & AI (LawAI)

@law_ai_

5 months

The first Cambridge Forum on Law and AI brought together leading and emerging legal scholars at Downing College to address challenges in AI law that have implications for security, welfare, and the rule of law.

0

2

Anthropic

@AnthropicAI

5 months

As part of our exploratory work on potential model welfare, we recently gave Claude Opus 4 and 4.1 the ability to end a rare subset of conversations on https://t.co/uLbS2JNczH.

333

185

3K

Ju-jitsu

@ju_vignon

6 months

Fantastic conference today on evaluating AI welfare and moral status, based on findings from the Claude 4 model welfare assessments 🔥 I hope it will be made available online so more people can enjoy it.

2

0

6

Maxwell Tabarrok

@MTabarrok

6 months

Court rulings on IP and AI are extremely important and being written right now! Much of the debate misunderstands IP as a problem of property rights rather than optimal subsidy design The correct understanding leads one to support laxer IP rules for AI https://t.co/U1tN91yBI8

maximum-progress.com

OR: Intellectual Property isn't About Property

4

8

46

Ju-jitsu

@ju_vignon

6 months

What a thoughtful book about humility toward minds unlike ours. 👏

0

1

Yoshua Bengio

@Yoshua_Bengio

6 months

The Code of Practice is out. I co-wrote the Safety & Security Chapter, which is an implementation tool to help frontier AI companies comply with the EU AI Act in a lean but effective way. I am proud of the result! 1/3

8

31

107

Ju-jitsu

@ju_vignon

7 months

Paper:

0

Ju-jitsu

@ju_vignon

7 months

Wow. Fantastic investigation as to why granting AI systems basic private-law rights (contract, property, tort) would be a strong first step toward a Law of AGI.

AXRP - the AI X-risk Research Podcast

@AXRPodcast

7 months

Episode 44 - Peter Salib on AI Rights for Human Safety https://t.co/EBunj8GnkC

2

0

1

Frontier Model Forum

@fmf_org

10 months

We're pleased to announce that all of FMF's member firms have signed a first-of-its-kind agreement to facilitate information-sharing about threats, vulnerabilities, and capability advances unique to frontier AI:

frontiermodelforum.org

The Frontier Model Forum (FMF) is proud to announce that all of its member firms have signed a first-of-its-kind agreement designed to facilitate information-sharing about threats, vulnerabilities,...

0

19

79

Oxford Martin AI Governance Initiative

@aigioxford

10 months

New Policy Brief! How can the UK and EU enhance AI security while respecting their distinct mandates? Our latest brief explores strategic alignment between the UK AISI and the EU AI Office to maximise impact while maintaining autonomy. @oxmartinschool https://t.co/xysuDqqMZi

1

4

10

OpenAI

@OpenAI

11 months

Detecting misbehavior in frontier reasoning models Chain-of-thought (CoT) reasoning models “think” in natural language understandable by humans. Monitoring their “thinking” has allowed us to detect misbehavior such as subverting tests in coding tasks, deceiving users, or giving

399

722

5K

Ju-jitsu

@ju_vignon

11 months

Link: https://t.co/WkYMLHOwWX

podcasts.apple.com

Podcast Episode · The Ezra Klein Show · 04/03/2025 · Subscribers Only · 1h 9m

0

Ju-jitsu

@ju_vignon

11 months

Ben Buchanan’s insights are also especially valuable for considering AI systems and their implications for government. Here in conversation with Ezra Klein. https://t.co/hXk3wZ0oyB

Ju-jitsu

@ju_vignon

2 years

Ben Buchanan, in The AI Triad and What It Means for National Security Strategy (2020), has a useful reminder of the saga around the release of GPT-2.

1

0