ju_vignon Profile Banner
Ju-jitsu Profile
Ju-jitsu

@ju_vignon

Followers
107
Following
227
Media
80
Statuses
460

Amazed by nature - interested in lossy compressions of the internet - worried about deceptive alignment and gradual disempowerment.

Joined March 2018
Don't wanna be here? Send us removal request.
@DavidDuvenaud
David Duvenaud
7 days
The talk voted “most mind-blowing” at our workshop was on post-AGI values by @BerenMillidge. The main idea: cooperation and pro-social values could remain viable because they’re competitive. After all, they won in our Malthusian past!
10
19
129
@ju_vignon
Ju-jitsu
1 month
Very interesting line of research. An ecosystem of sub-AGI AI agents may collectively exhibit AGI-level capabilities: safety work must extend beyond single models.
@sebkrier
Séb Krier
1 month
New paper: we argue AGI may first emerge as collective intelligence across agent networks, not a single system. This reframes the challenge from aligning one mind to governing emergent dynamics: more institutional design than single-agent alignment. https://t.co/vwuHPzRUav
0
0
0
@OwainEvans_UK
Owain Evans
1 month
I gave the Hinton Lectures in November in Toronto. This is 3 lectures on the future of AI, risks, & current alignment research for a general audience. Lectures are now online with professional production. There's also an excellent fireside chat with Hinton after lecture 3.
3
24
191
@AnthropicAI
Anthropic
3 months
Even when new AI models bring clear improvements in capabilities, deprecating the older generations comes with downsides. An update on how we’re thinking about these costs, and some of the early steps we’re taking to mitigate them:
Tweet card summary image
anthropic.com
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
154
165
1K
@ju_vignon
Ju-jitsu
3 months
Great paper on reframing AI personhood: it pivots to explore what rights, responsibilities, and liabilities should attach here and now. An interesting, promising shift after years of metaphysical deadlock.
@jzl86
Joel Z Leibo
3 months
Here's our deeply Rorty-influenced paper on the topic:
0
1
2
@ju_vignon
Ju-jitsu
5 months
@petersalib @80000Hours @fish_kyle3 @simondgoldstein I, too, very much enjoyed the podcast with Kyle Fish. Private-law status for AI systems is a compelling idea, yet we run the risk that capability asymmetry concentrates money and power in AI hands. Taxation is appealing in theory but I am worried that once AGI owns the pipes, it
0
1
0
@OwainEvans_UK
Owain Evans
5 months
New explainer video about subliminal learning by Welch Labs. Great visuals and explanations throughout. Explains the core ideas and goes deep into the MNIST results, theory, and follow-up work. https://t.co/zP5lRxaT7H
1
10
62
@ju_vignon
Ju-jitsu
5 months
I’m really grateful to the Institute for Law & AI to have organised the Cambridge forum. Very useful days with friends and colleagues, focused on the most pressing EU AI law and governance issues.
@law_ai_
Institute for Law & AI (LawAI)
5 months
The first Cambridge Forum on Law and AI brought together leading and emerging legal scholars at Downing College to address challenges in AI law that have implications for security, welfare, and the rule of law.
0
0
2
@AnthropicAI
Anthropic
5 months
As part of our exploratory work on potential model welfare, we recently gave Claude Opus 4 and 4.1 the ability to end a rare subset of conversations on https://t.co/uLbS2JNczH.
333
185
3K
@ju_vignon
Ju-jitsu
6 months
Fantastic conference today on evaluating AI welfare and moral status, based on findings from the Claude 4 model welfare assessments 🔥 I hope it will be made available online so more people can enjoy it.
2
0
6
@MTabarrok
Maxwell Tabarrok
6 months
Court rulings on IP and AI are extremely important and being written right now! Much of the debate misunderstands IP as a problem of property rights rather than optimal subsidy design The correct understanding leads one to support laxer IP rules for AI https://t.co/U1tN91yBI8
Tweet card summary image
maximum-progress.com
OR: Intellectual Property isn't About Property
4
8
46
@ju_vignon
Ju-jitsu
6 months
What a thoughtful book about humility toward minds unlike ours. 👏
0
0
1
@Yoshua_Bengio
Yoshua Bengio
6 months
The Code of Practice is out. I co-wrote the Safety & Security Chapter, which is an implementation tool to help frontier AI companies comply with the EU AI Act in a lean but effective way. I am proud of the result! 1/3
8
31
107
@ju_vignon
Ju-jitsu
7 months
Paper:
0
0
0
@ju_vignon
Ju-jitsu
7 months
Wow. Fantastic investigation as to why granting AI systems basic private-law rights (contract, property, tort) would be a strong first step toward a Law of AGI.
@AXRPodcast
AXRP - the AI X-risk Research Podcast
7 months
Episode 44 - Peter Salib on AI Rights for Human Safety https://t.co/EBunj8GnkC
2
0
1
@fmf_org
Frontier Model Forum
10 months
We're pleased to announce that all of FMF's member firms have signed a first-of-its-kind agreement to facilitate information-sharing about threats, vulnerabilities, and capability advances unique to frontier AI:
Tweet card summary image
frontiermodelforum.org
The Frontier Model Forum (FMF) is proud to announce that all of its member firms have signed a first-of-its-kind agreement designed to facilitate information-sharing about threats, vulnerabilities,...
0
19
79
@aigioxford
Oxford Martin AI Governance Initiative
10 months
New Policy Brief! How can the UK and EU enhance AI security while respecting their distinct mandates? Our latest brief explores strategic alignment between the UK AISI and the EU AI Office to maximise impact while maintaining autonomy. @oxmartinschool https://t.co/xysuDqqMZi
1
4
10
@OpenAI
OpenAI
11 months
Detecting misbehavior in frontier reasoning models Chain-of-thought (CoT) reasoning models “think” in natural language understandable by humans. Monitoring their “thinking” has allowed us to detect misbehavior such as subverting tests in coding tasks, deceiving users, or giving
399
722
5K
@ju_vignon
Ju-jitsu
11 months
Ben Buchanan’s insights are also especially valuable for considering AI systems and their implications for government. Here in conversation with Ezra Klein. https://t.co/hXk3wZ0oyB
@ju_vignon
Ju-jitsu
2 years
Ben Buchanan, in The AI Triad and What It Means for National Security Strategy (2020), has a useful reminder of the saga around the release of GPT-2.
1
0
0