Ryan Lowe 🥞 @ICML @ryan_t_lowe X Profile

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

Followers

6K

Following

1K

Media

43

Statuses

662

full-stack alignment 🥞 @meaningaligned prev: InstructGPT @OpenAI 🦋 @ ryantlowe

Berkeley, CA

Joined May 2009

Don't wanna be here? Send us removal request.

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

Introducing: Full-Stack Alignment 🥞. A research program dedicated to co-aligning AI systems *and* institutions with what people value. It's the most ambitious project I've ever undertaken. Here's what we're doing: 🧵

10

41

176

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

1 day

RT @PhilippKoralus: Excited for the launch of the position paper that resulted from our Oxford HAI Lab 2025 Thick Models of Choice workshop….

0

7

0

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

1 day

RT @xuanalogue: Ever since I started thinking seriously about AI value alignment in 2016-7, I've been frustrated by the inadequacy of utili….

0

19

0

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

1 day

RT @RyanOthKearns: It was terrifically energising to work on this position paper. Floored by the ambition and optimism coming out of the @m….

0

2

0

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

1 day

I expect @j_foerst will do some of the best FSA-relevant research around, particularly on "win-win AI negotiation". if you're about to do a PhD strongly consider joining him at @FLAIR_Ox !!.

Jakob Foerster

@j_foerst

1 day

The term "AI alignment" is often used without specifying "to whom?" and much of the work on AI alignment in practice looks more like "AI controllability" without answering "who controls the controller?" (i.e. user or operator). One key challenge is that alignment is fundamentally.

0

7

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

1 day

RT @Dr_Atoosa: Excited to be a contributor to full-stack alignment (FSA) ⭐️ you can read our position paper about the conceptual foundation….

0

4

0

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

RT @DefenderOfBasic: all of the AI alignment efforts are obviously guaranteed to fail because they're trying to do it in isolation, except….

0

4

0

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

I guess now is also a good time to announce that I've officially joined @meaningaligned!!. I'll be working on field building for full-stack alignment -- helping nurture this effort into a research community with excellent vibes that gets shit done . weeeeeeeeeee 🚀🚀.

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

Introducing: Full-Stack Alignment 🥞. A research program dedicated to co-aligning AI systems *and* institutions with what people value. It's the most ambitious project I've ever undertaken. Here's what we're doing: 🧵

2

3

54

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

RT @klingefjord: Extremely honored to be working on this project alongside a series of amazing researchers!!. This research program is our….

0

3

0

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

RT @IasonGabriel: Check out this great new initiative + paper led by @ryan_t_lowe, @edelwax, @xuanalogue, @klingefjord & the fine folks @me….

0

9

0

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

RT @edelwax: In 2017, I was working to change FB News Feed's recommender to use “thick models of value” (per the paper we just released). @….

0

6

0

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

RT @sydneymlevine: Excited to be part of this exciting vision!.

0

3

0

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

If you're excited about these ideas, drop me a line!! We're looking for researchers to collaborate with -- send an email to research@meaningalignment.org. It's gonna be fun. ✌️.

5

1

32

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

This is a huge project. We'll need lots of help. But if we succeed, the future could be more beautiful than we can possibly imagine today.

1

0

20

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

Examples of TMV include: resource-rational contractualism by @sydneymlevine et al, self-other overlap by.@MarcCarauleanu et al, and our previous work on moral graph elicitation. It's an emerging field, but we think early research is very promising!!.

1

0

16

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

Instead, we call for a new paradigm — "Thick models of value" (TMV). TMV is a broad class of structured approaches to modeling values and norms that:. 1. are more robust against distortions.2. have better treatment of collective values and norms.3. have better generalization

1

0

16

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

It's like programming in a completely dynamic language without any type system. Kinda sketch if we design all of our institutions around unstructured text and hope LLMs interpret them in the way we intended.

2

0

11

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

In principle, unstructured text could be an improvement; after all, language is how humans naturally express values. But this lack of internal structure becomes a critical weakness when we need *reliability* across contexts and institutions.

1

0

12

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

In practice this looks like: a desire for "meaningful connection" becomes "engagement metrics" to recommender systems, which becomes "daily active users" to companies, and "quarterly revenue" in markets.

1

0

11

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

PMV in particular (the dominant paradigm in microeconomics, game theory, mechanism design, social choice theory, etc) fails to capture the richness of human motivation, because preferences bundle all kinds of signals into a flattened ordering.

1

0

13

Ryan Lowe 🥞 @ICML

@ryan_t_lowe

2 days

Current approaches tend to fall into what we call "Preferentist models of Value" (PMV), or "Values-as-text" (VAT). Both have issues preserving the richness of what people care about, as value information propagates up the "societal stack"

1

0

13