Louis Castricato @lcastricato profile

Louis Castricato

@lcastricato

Followers

3,538

Following

481

Media

422

Statuses

7,593

Math @uwaterloo , RLHF @BrownCSDept , Goosefluencer. x-RS @aieleuther , x-Head of LLMs @stabilityai , x-lead @CarperAI . co-founder @synth_labs . We're hiring.

https://t.co/iWzykPqzVx

Providence, RI

Joined September 2017

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

Charles • 531895 Tweets

#KKRvsSRH • 135108 Tweets

#KNYvGS • 99595 Tweets

Konya • 92755 Tweets

Şampiyon Galatasaray • 87595 Tweets

Leeds • 69691 Tweets

#MayıslarBizimdir • 65280 Tweets

Gautam Gambhir • 57372 Tweets

Meral • 55744 Tweets

Geçen • 50079 Tweets

Shreyas Iyer • 45766 Tweets

#GazaGenocide • 39959 Tweets

Southampton • 39815 Tweets

Okan Buruk • 28427 Tweets

Ankaragücü • 27815 Tweets

Congratulations KKR • 26624 Tweets

アニオリ • 22409 Tweets

#IPL2O24 • 21760 Tweets

#Fenerbahçe • 21444 Tweets

Icardi • 20277 Tweets

Trabzon • 19082 Tweets

Seneye • 16448 Tweets

Beter • 15193 Tweets

Larson • 14228 Tweets

#Hedef25 • 11672 Tweets

#توظيف_الاطباء_الكويتيين

#سناب_مرزوقه_والمشاهير

#العين_حديث_العالم

#موسي_مهاجر_حاظر_وماضي

#من_سرب_لجيسوس_موعد_كاس_الملك

#زعيم_تويتر_بيطالب_١٣يااهلي

Moutet

Mehmet Büyükekşi

شتيغن

Ter Stegen

Adriano

Diego Ribas

İYİLER SONUNDA MUTLAKA KA24NIR

Prass

KOÇ MHY

Emre Belözoğlu

Tabilo

Denilson

İsmail Kartal

Ronaldinho

Ludmilla

#عتق_رقبه_عبدالعزيز_الشمري

#ادفع_قبل_ارفع_ترند_Oちб88ち8бち9

#مجزره_رفح

#ميدلي_بلقيس

Last Seen Profiles

@ejtheclown

@koba2

@CSC_Granada

@malawitobacco

@RyanTargac

@AmmarAhmedddd

@nensiissad

@gracebitzer

@jmanh200

@makyrel

@Halima_Porn

@AlexanderWhos

@KresnaPuntoaji

@StephenMGavazzi

@Heehee1220689

@ELeeZimmerman

@WGrainwheat

@sexting_tr

@NGHC_

@DoctorStark

Pinned Tweet

Louis Castricato

@lcastricato

3 months

Come build with us

SynthLabs

We’re doing cutting edge research for transparent, auditable AI alignment.

www.synthlabs.ai

Rachel Metz

@rachelmetz

3 months

An *exclusive* from me about a new startup, @synth_labs , which is working to help companies get their AI systems do what they want (and avoid doing what they don't want). via @technology

1

17

39

8

6

43

Louis Castricato

@lcastricato

2 years

Joining @huggingface this summer as a research intern to work on large scale contrastive learning and retrieval, super excited 🎉

12

8

273

Louis Castricato

@lcastricato

2 years

Waited a long time to tweet this 😅 but I'm happy to announce that I am a recipient of @StabilityAI PhD fellowship that'll be funding my studies at @BrownCSDept until 2026.

18

6

198

Louis Castricato

@lcastricato

3 months

Excited to announce that I've founded new company! is dedicated to solving the AI alignment challenge–ensuring AIs are robustly aligned with human intentions & values. Grateful to have @NathanThinks @fdesouza as cofounders & Seed funding from @M12vc &

18

183

Louis Castricato

@lcastricato

2 years

So uh... I'm joining @BrownCSDept in the fall as a PhD student to work on narrative theory and reader models 🎉

21

3

171

Louis Castricato

@lcastricato

10 months

As of today I have left my role at Stability and Carper. I wish them best of luck, and I am excited to join some of my closest friends at @AiEleuther along side @BlancheMinerva :)

19

2

156

Louis Castricato

@lcastricato

1 year

Recent advances with language models have been powered by Reinforcement Learning with Human Feedback (RLHF). At Carper, we're developing production ready open-source RLHF tools. Blog post by @natolambert , myself, @lvwerra and @Dahoas1

1

38

151

Louis Castricato

@lcastricato

1 year

Since this will only make me cool for a few more days, I have early access to an RLHF tuned open assistant. () Comment below with prompts.

10

19

152

Louis Castricato

@lcastricato

2 years

Thanks for the retweet! Are you interested in doing RLHF for story generation but using no human annotation data? Our research direction at Carper has got you covered! We explore the limitations of CLIP-like story critic models ala CARP and show various

AK

@_akhaliq

2 years

Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning abs:

1

58

304

4

17

110

Louis Castricato

@lcastricato

1 year

. @StabilityAI neurips 2022 party has commenced #StabilityNeurIPS

7

5

92

Louis Castricato

@lcastricato

1 year

. @carperai just passed a major milestone a few days ago, its now a whole year old. I thought we could take this time to recap the achievements we've accomplished over the last year as well as discuss potential directions. of the lab. A thread 🧵1/N

2

20

92

Louis Castricato

@lcastricato

3 years

Three first author accepts :’)

5

1

71

Louis Castricato

@lcastricato

2 months

we're hiring for all roles. Open science stuff we're working on: 1) RLAIF for pretraining (we're making open source datasets). 2) benchmarks benchmarks benchmarks. 3) collaborating with @AiEleuther on some awesome projects. Work with us.

4

16

69

Louis Castricato

@lcastricato

4 years

@geoffreyhinton Yes, if t can’t explain how it works then how do you know who is to be held liable if something goes wrong? How do you know if it’s reasoning was flawless but it just so happen to be unlucky circumstances?

8

2

62

Louis Castricato

@lcastricato

2 months

you're looking at two people working on sick open source research. work with us.

3

4

63

Louis Castricato

@lcastricato

2 years

Has anyone like seriously looked into training a speech language model on dolphins?

11

2

61

Louis Castricato

@lcastricato

2 years

From @jessemhan #dalle #goose

1

3

58

Louis Castricato

@lcastricato

2 years

So excited for my first day at HF 🥰

2

56

Louis Castricato

@lcastricato

1 year

It finally arrived 🥰 @carperai

5

2

56

Louis Castricato

@lcastricato

5 months

doing an open source RLHF/mech interp meet up. Capacity is a bit limited. I'll be accepting people with substantial open source work in this space first and foremost.

NeurIPS Open Source RLHF/Mech Interp meetup · Luma

Louis Castricato is hosting food+drinks, and fun discussions about RLHF alignment, mechanistic interpretability, DPO, synthetic data (Only for researchers. No…

lu.ma

1

12

20

Louis Castricato

@lcastricato

3 years

Releasing the demo notebook! You can try CARP-L here, just click "Run all" You can edit the stories and the critiques. It'll perform prompt softening under the hood using Pegasus.

CARP-L Demo

Colaboratory notebook

colab.research.google.com

AK

@_akhaliq

3 years

Cut the CARP: Fishing for zero-shot story evaluation abs: a scalable, efficient method for performing qualitatively superior, zero-shot evaluation of stories and a new corpora composed of 1.3M aligned story-critique pairs derived from over 80,000 stories

4

22

120

2

10

48

Louis Castricato

@lcastricato

2 years

I finally met one of my favorite geese @iScienceLuvr

3

1

45

Louis Castricato

@lcastricato

3 years

Zero shot semantic style transfer for editing faces. Original, Spock, James T. Kirk designed by Bauhaus, Low poly Hikaru Sulu. Edits are performed using CLIP system guided by L-grammars and Faces VQGAN.

5

2

44

Louis Castricato

@lcastricato

1 year

@mark_riedl Holy shit I almost died laughing, I'm waiting for someone to try to recruit a candidate they found via a ouija board for a company board

0

1

42

Louis Castricato

@lcastricato

2 years

Shame on @WriteSonic for using Stable Diffusion on their website while (illegally) not mentioning the RAIL license or crediting @StabilityAI . I really hope you guys didn't try to pass that model as your own to VCs ;)

3

6

39

Louis Castricato

@lcastricato

6 months

Spotted at #EMNLP

4

3

39

Louis Castricato

@lcastricato

3 years

@ak92501 You beat me to tweeting my own paper LMAO

1

39

Louis Castricato

@lcastricato

1 year

I have some stuff coming out later today on RLHF, keep your eyes peeled 😎

0

38

Louis Castricato

@lcastricato

6 months

Presenting on trlX at 20C #EMNLP

2

4

35

Louis Castricato

@lcastricato

1 year

new activation function just dropped

2

0

32

Louis Castricato

@lcastricato

6 months

Thanks to everyone who came out to the third new england RLHF hackers hackathon. We're going to be doing the next RLHF hackathon at NeurIPS 2023. An exact time and date has yet to be decided, but you can join the discord for more information.

Join the SynthLabs Discord Server!

Check out the SynthLabs community on Discord - hang out with 352 other members and enjoy free voice and text chat.

discord.com

2

7

31

Louis Castricato

@lcastricato

1 year

Louis Castricato

@lcastricato

1 year

I closed on the house 🥰

2

0

23

3

0

31

Louis Castricato

@lcastricato

3 years

I just bought and I have no idea what to put there

7

1

30

Louis Castricato

@lcastricato

4 years

@dril_gpt2 She tied pulling dril GPT2 out of his GPU. Awful! AI cruelty!

1

0

26

Louis Castricato

@lcastricato

12 days

honored to be on @swyx 's high signal ai list, which is pleasantly devoid of @iScienceLuvr 🥰

3

0

28

Louis Castricato

@lcastricato

2 months

Hiring a world class ML Engineer at SynthLabs. Drop me a note if you want to push world class open science and build a strong and loving community.

2

7

28

Louis Castricato

@lcastricato

3 years

Carp seq2seq (based off of GPT-J) is on the eye. You provide stories and it critiques them. Carp eval, for automated story evaluation, will be out within the coming weeks. License is MIT. Paper soon.

2

3

28

Louis Castricato

@lcastricato

10 months

The amount of recruiters I've had contact me in the past 24 hours is *insane* LMFAO

2

0

28

Louis Castricato

@lcastricato

1 year

@ylecun The model is clearly not GPL though no? Only the code is?

1

0

27

Louis Castricato

@lcastricato

1 year

@aichip1 @drjwrae RLHF at smaller scales absolutely does work. We have some stuff coming out soon, I'm pretty excited.

0

3

28

Louis Castricato

@lcastricato

4 years

@colinraffel “Oh cool some hard math! The work must be right then” it’s so hard to get feedback on my mathematics if no one in my field fully understands what I’m actually doing! Everyone gets it at an intuitive level sure, but I still don’t know if I’m using the right abstractions.

1

0

28

Louis Castricato

@lcastricato

3 years

0

2

26

Louis Castricato

@lcastricato

2 years

Sci-fi cyberpunk canada goose, trending on art station #DALLE

1

2

26

Louis Castricato

@lcastricato

1 year

Got something really spicy coming tomorrow ;)

2

0

25

Louis Castricato

@lcastricato

4 years

In an event to mimic the success of @KordingLab 's recent neuro event, I decided to host my own pure math in deep learning event. Here is the calendar event URL. Please RSVP if you intend to attend. The meeting is tomorrow night (Tuesday) at 10PM EST.

10

25

Louis Castricato

@lcastricato

2 years

@arankomatsuzaki The real peer review begins when Phil implements this

0

25

Louis Castricato

@lcastricato

2 years

i'll give $50 and an appropriate amount of compute to someone who wants to finetune stable diffusion on geese for me.

3

0

24

Louis Castricato

@lcastricato

4 years

@arankomatsuzaki Absolutely shameless plug: I wrote about this exact relationship back in April. I think my theory took it in a different direction than theirs as I borrowed more deeply from the cognitive neuroscience side of things. I should contact the authors.

1

2

22

Louis Castricato

@lcastricato

2 years

Trying to figure out interest, who would be interested in a @StabilityAI launch event in Boston on October 16th? A dinner party in Cambridge, hosted by yours truly.

Not interested

96

Interested

89

Tentative

38

6

4

23

Louis Castricato

@lcastricato

2 years

@ClementDelangue There's an Eleuther meetup in dolores park at 3pm on Sunday if that counts 😆

5

1

23

Louis Castricato

@lcastricato

2 years

Detective #goose #dalle

0

1

22

Louis Castricato

@lcastricato

1 year

I closed on the house 🥰

2

0

23

Louis Castricato

@lcastricato

1 year

@omarsar0 Incredibly misleading. We have no fast how ChatGPT trained or if it was based on a chinchilla-like approach. Similarly its only 15x faster because inference fits on a single gpu, not because of anything the repository does. This is a thin DS wrapper around a naive PPO implt.

2

1

23

Louis Castricato

@lcastricato

1 year

@omarsar0 For a signficiantly faster RLHF implementation, check out trlX. Its usually 3x-4x faster than competing implementations. Including this one.

GitHub - CarperAI/trlx: A repo for distributed training of language models with Reinforcement...

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF) - CarperAI/trlx

github.com

0

22

Louis Castricato

@lcastricato

1 year

Come watch me talk about past present and future of RLHF @Carperai tonight at 5:55 PM EST on

stabilityai - Twitch

AI by the people, for the people.

www.twitch.tv

0

3

22

Louis Castricato

@lcastricato

4 years

@dril_gpt2 Do you think all the GPT2 accounts are friends

1

0

18

Louis Castricato

@lcastricato

1 year

Anyone have a paper to cite that all RLHF is implicitly model based RL? Seems super obvious but I can't recall any papers making this claim, just a few lesswrong posts.

5

0

22

Louis Castricato

@lcastricato

10 months

@josephofiowa This is the kind of content I use this platform for

1

0

21

Louis Castricato

@lcastricato

6 months

Stella is completely right. I am at EMNLP right now and I've already heard two people quote @kchonyc as if his joke was fact.

Stella Biderman

@BlancheMinerva

6 months

@kchonyc @GoogleDeepMind This is misinformation. The model does not know this information and people will take this as evidence that the numbers are correct. Please delete it and refrain from encouraging people to ask models about themselves

5

2

142

2

0

21

Louis Castricato

@lcastricato

1 year

Tiny RLHF ideas!

Rosanne Liu

@savvyRL

1 year

Now that we can write Tiny Papers @iclr_conf , what should we write about? I'd like to invite all established researchers to contribute Tiny Ideas as inspirations, seeds for discussions & future collaborations! #TinyIdeasForTinyPapers I'll start. Note: bad ideas == good starts.

5

22

166

1

4

21

Louis Castricato

@lcastricato

7 months

I'll be at EMNLP and neurips this year if anyone wants to get dinner or hang out

8

0

21

Louis Castricato

@lcastricato

1 year

I just got the clear to close on my new house today 🥰

3

0

21

Louis Castricato

@lcastricato

2 years

Thank you google, incredibly based and goosepilled.

Google AI

@GoogleAI

2 years

MUSIQ is a new approach for image quality assessment (IQA) that uses a patch-based multi-scale transformer architecture, bypassing the typical fixed input size constraint of CNN-based architectures, to achieve state-of-the-art IQA performance. Read more →

4

56

221

1

19

Louis Castricato

@lcastricato

2 years

Credit EleutherAI memes channel

1

0

20

Louis Castricato

@lcastricato

1 year

Coming soon to a Carper near you...

4

1

20

Louis Castricato

@lcastricato

2 years

0

19

Louis Castricato

@lcastricato

6 months

Hanging out at neurips come say hi

0

18

Louis Castricato

@lcastricato

4 years

Joining @gtcomputing in the fall as an MSCS student.

5

0

18

Louis Castricato

@lcastricato

1 year

In the coming weeks and months, we'll be putting out blog posts outlining our new direction. We want to make RLHF accessible to everyone, no matter how advanced you are. We want to lower the barrier of entry until some person in their basement with a GPU can make ChatGPT.13/N

1

0

18

Louis Castricato

@lcastricato

6 months

I'm hanging out on the conference floor today if anyone wants to meet up!

3

0

18

Louis Castricato

@lcastricato

2 years

I'm ready for my birthday honks 💅

5

0

18

Louis Castricato

@lcastricato

3 years

@Hector_Lowe @_joaogui1 Picture of the paleontologists

0

17

Louis Castricato

@lcastricato

1 year

Passed part of my quals yesterday, got a new road bike to celebrate (have wanted a new one since last summer anyway and had been saving up for it since then). Never liked drop bars but for some reason on the Kona they're so incredibly comfortable I'm kinda shocked haha

1

0

17

Louis Castricato

@lcastricato

1 year

@_joaogui1 @rodrigfnogueira @carperai Didn't wanna reply from the CarperAI account since this is my opinion and not my org's opinion but we have some preliminary results already confirming that RL makes a noticeable difference in terms of capabilities. 1/2

Implementing RLHF: Learning to Summarize with trlX

Implementation of Reinforcement Learning with Human Feedback for text summarization task using CarperAI's trlX framework. Made by Ayush Thakur using W&B

wandb.ai

2

1

17

Louis Castricato

@lcastricato

2 years

@hardmaru Aaand now I want a 4090 :/

2

0

17

Louis Castricato

@lcastricato

5 months

Hi 👋 if you are interested in: 🪿 Goose explainers, discussion, and resources 📃 Sharing the top goose related news 🔬 Learning more about geese 😂 Goose-related memes Consider following me. ✔️ I have lots of great content planned for the new year! Stay tuned!🎉

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

5 months

Hi 👋 if you are interested in: 🤖 AI explainers, discussion, and resources 📃 Sharing the top AI papers every week 🔬 Learning more about medical AI research 😂 AI-related memes Consider following me. ✔️ I have lots of great content planned for the new year! Stay tuned!🎉

6

4

131

1

0

17

Louis Castricato

@lcastricato

2 years

Gary Marcus

@GaryMarcus

2 years

Arithmetic Smackdown! 🥊 👉 Worst fiasco in CPU in history vs 👉 Deep learning’s greatest hit! And the winner is …

10

20

103

1

2

17

Louis Castricato

@lcastricato

1 year

@repligate I'm using RLHF to make goosegirls to be fair

1

0

16

Louis Castricato

@lcastricato

9 months

I got plants 😎 I'm a plant dad now

0

16

Louis Castricato

@lcastricato

10 months

Love being able to dunk on RLHF for five minutes straight and still get an applause 🥰

Lina Colucci, PhD

@lina_colucci

10 months

@lcastricato talks about RLHF and @carperai at @agihouse_org

0

4

25

0

16

Louis Castricato

@lcastricato

1 year

With your help, we can achieve this. If you are interested in volunteering please consider joining our discord.14/14

Discord - A New Way to Chat with Friends & Communities

Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.

discord.com

1

0

16

Louis Castricato

@lcastricato

3 years

@kaixhin Basically failed introduction to neuroscience, went on to win a best paper a few months after on neuroscience

1

0

15

Louis Castricato

@lcastricato

2 years

New CARP model just dropped! Model: Example script: Half the number of parameters but matches CARP-L performance across the board. Also perhaps subjective but qualitatively the results it produces are much more satisfying ;)

Louis Castricato

@lcastricato

3 years

Releasing the demo notebook! You can try CARP-L here, just click "Run all" You can edit the stories and the critiques. It'll perform prompt softening under the hood using Pegasus.

2

10

48

1

2

14

Louis Castricato

@lcastricato

1 year

Twenty minutes until the stability party 😎

1

0

15

Louis Castricato

@lcastricato

1 month

Nabeel S. Qureshi

@nabeelqu

1 month

Waterloo grads are unbelievably cracked software engineers. Has anyone written up an essay on what they’re doing there? Would be interested to read.

117

87

2K

1

15

Louis Castricato

@lcastricato

4 years

@intrnetdaughter

0

15

Louis Castricato

@lcastricato

2 years

0

15

Louis Castricato

@lcastricato

1 year

Releases are always so stressful....

1

0

14

Louis Castricato

@lcastricato

4 years

@pixlpa Fabula vs syuzhet. The idea that a written story is different than the actual raw story (Eg a story need not be written to express identical information)

1

2

14

Louis Castricato

@lcastricato

3 years

me n the boys having an absolute rager

1

0

14

Louis Castricato

@lcastricato

4 years

@gabrielpeyre @jwkritchie Bill Cook did a whole course on integer programming and solving TSP using DNNs. I recommend you read it! It’s super cool. A lot of people are copying his work now adays and not crediting him, it really sucks. He did combinatorial deep learning back in 2012ish

2

14

Louis Castricato

@lcastricato

2 years

Come work on fun code projects with us at #EleutherAI we need an AWS expert! Easy way to get onto a paper with the lovely @TaliaRinger @moyix and @BlancheMinerva :)

Eric Schles

@EricSchles

2 years

I'm trying to get this code moved from Azure terraform to AWS terraform: anyone want to pair with me?😅

0

3

2

6

14

Louis Castricato

@lcastricato

1 year

First driving lesson today, I kept forgetting I was the driver.

1

0

14

Louis Castricato

@lcastricato

1 year

:goose16:

1

0

13

Louis Castricato

@lcastricato

3 years

“Towards a Model-theoretic View of Narratives” @BlancheMinerva @recardona @DavidThue and myself describe a foundation of narratology in the vocabulary of model and information theory. 1/N

2

0

14

Louis Castricato

@lcastricato

1 year

Feelin like a silly goose tn

0

14

Louis Castricato

@lcastricato

1 year

@arankomatsuzaki 1) algorithm distillation 2) evolution through chain of thought to build out extensive behavior cloning datasets 3) Offline RL, results in better reward/TFLOPS 4) synthetic preferences 5) various (many) ways to improve generation diversity and prevent modal collapse.

1

0

13

Louis Castricato

@lcastricato

3 years

I should make a bot that screenshots anyone's #NFT and sells it at less than what they were selling it at (and donates all income)

1

0

13

Louis Castricato

@lcastricato

6 months

What if Q* was the friends we made along the way

0

13