ryan_t_lowe Profile Banner
Ryan Lowe 🥞 @ICML Profile
Ryan Lowe 🥞 @ICML

@ryan_t_lowe

Followers
6K
Following
1K
Media
43
Statuses
662

full-stack alignment 🥞 @meaningaligned prev: InstructGPT @OpenAI 🦋 @ ryantlowe

Berkeley, CA
Joined May 2009
Don't wanna be here? Send us removal request.
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
Introducing: Full-Stack Alignment 🥞. A research program dedicated to co-aligning AI systems *and* institutions with what people value. It's the most ambitious project I've ever undertaken. Here's what we're doing: 🧵
Tweet media one
10
41
176
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
1 day
RT @PhilippKoralus: Excited for the launch of the position paper that resulted from our Oxford HAI Lab 2025 Thick Models of Choice workshop….
0
7
0
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
1 day
RT @xuanalogue: Ever since I started thinking seriously about AI value alignment in 2016-7, I've been frustrated by the inadequacy of utili….
0
19
0
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
1 day
RT @RyanOthKearns: It was terrifically energising to work on this position paper. Floored by the ambition and optimism coming out of the @m….
0
2
0
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
1 day
I expect @j_foerst will do some of the best FSA-relevant research around, particularly on "win-win AI negotiation". if you're about to do a PhD strongly consider joining him at @FLAIR_Ox !!.
@j_foerst
Jakob Foerster
1 day
The term "AI alignment" is often used without specifying "to whom?" and much of the work on AI alignment in practice looks more like "AI controllability" without answering "who controls the controller?" (i.e. user or operator). One key challenge is that alignment is fundamentally.
0
0
7
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
1 day
RT @Dr_Atoosa: Excited to be a contributor to full-stack alignment (FSA) ⭐️ you can read our position paper about the conceptual foundation….
0
4
0
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
RT @DefenderOfBasic: all of the AI alignment efforts are obviously guaranteed to fail because they're trying to do it in isolation, except….
0
4
0
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
I guess now is also a good time to announce that I've officially joined @meaningaligned!!. I'll be working on field building for full-stack alignment -- helping nurture this effort into a research community with excellent vibes that gets shit done . weeeeeeeeeee 🚀🚀.
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
Introducing: Full-Stack Alignment 🥞. A research program dedicated to co-aligning AI systems *and* institutions with what people value. It's the most ambitious project I've ever undertaken. Here's what we're doing: 🧵
Tweet media one
2
3
54
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
RT @klingefjord: Extremely honored to be working on this project alongside a series of amazing researchers!!. This research program is our….
0
3
0
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
RT @IasonGabriel: Check out this great new initiative + paper led by @ryan_t_lowe, @edelwax, @xuanalogue, @klingefjord & the fine folks @me….
0
9
0
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
RT @edelwax: In 2017, I was working to change FB News Feed's recommender to use “thick models of value” (per the paper we just released). @….
0
6
0
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
RT @sydneymlevine: Excited to be part of this exciting vision!.
0
3
0
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
If you're excited about these ideas, drop me a line!! We're looking for researchers to collaborate with -- send an email to research@meaningalignment.org. It's gonna be fun. ✌️.
5
1
32
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
This is a huge project. We'll need lots of help. But if we succeed, the future could be more beautiful than we can possibly imagine today.
Tweet media one
1
0
20
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
Examples of TMV include: resource-rational contractualism by @sydneymlevine et al, self-other overlap by.@MarcCarauleanu et al, and our previous work on moral graph elicitation. It's an emerging field, but we think early research is very promising!!.
1
0
16
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
Instead, we call for a new paradigm — "Thick models of value" (TMV). TMV is a broad class of structured approaches to modeling values and norms that:. 1. are more robust against distortions.2. have better treatment of collective values and norms.3. have better generalization
Tweet media one
1
0
16
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
It's like programming in a completely dynamic language without any type system. Kinda sketch if we design all of our institutions around unstructured text and hope LLMs interpret them in the way we intended.
2
0
11
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
In principle, unstructured text could be an improvement; after all, language is how humans naturally express values. But this lack of internal structure becomes a critical weakness when we need *reliability* across contexts and institutions.
Tweet media one
1
0
12
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
In practice this looks like: a desire for "meaningful connection" becomes "engagement metrics" to recommender systems, which becomes "daily active users" to companies, and "quarterly revenue" in markets.
1
0
11
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
PMV in particular (the dominant paradigm in microeconomics, game theory, mechanism design, social choice theory, etc) fails to capture the richness of human motivation, because preferences bundle all kinds of signals into a flattened ordering.
Tweet media one
1
0
13
@ryan_t_lowe
Ryan Lowe 🥞 @ICML
2 days
Current approaches tend to fall into what we call "Preferentist models of Value" (PMV), or "Values-as-text" (VAT). Both have issues preserving the richness of what people care about, as value information propagates up the "societal stack"
Tweet media one
1
0
13