Louis Castricato Profile Banner
Louis Castricato Profile
Louis Castricato

@lcastricato

Followers
3,538
Following
481
Media
422
Statuses
7,593

Math @uwaterloo , RLHF @BrownCSDept , Goosefluencer. x-RS @aieleuther , x-Head of LLMs @stabilityai , x-lead @CarperAI . co-founder @synth_labs . We're hiring.

Providence, RI
Joined September 2017
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@lcastricato
Louis Castricato
3 months
Come build with us
@rachelmetz
Rachel Metz
3 months
An *exclusive* from me about a new startup, @synth_labs , which is working to help companies get their AI systems do what they want (and avoid doing what they don't want). via @technology
1
17
39
8
6
43
@lcastricato
Louis Castricato
2 years
Joining @huggingface this summer as a research intern to work on large scale contrastive learning and retrieval, super excited 🎉
12
8
273
@lcastricato
Louis Castricato
2 years
Waited a long time to tweet this 😅 but I'm happy to announce that I am a recipient of @StabilityAI PhD fellowship that'll be funding my studies at @BrownCSDept until 2026.
18
6
198
@lcastricato
Louis Castricato
3 months
Excited to announce that I've founded new company! is dedicated to solving the AI alignment challenge–ensuring AIs are robustly aligned with human intentions & values. Grateful to have @NathanThinks @fdesouza as cofounders & Seed funding from @M12vc &
Tweet media one
18
18
183
@lcastricato
Louis Castricato
2 years
So uh... I'm joining @BrownCSDept in the fall as a PhD student to work on narrative theory and reader models 🎉
Tweet media one
21
3
171
@lcastricato
Louis Castricato
10 months
As of today I have left my role at Stability and Carper. I wish them best of luck, and I am excited to join some of my closest friends at @AiEleuther along side @BlancheMinerva :)
19
2
156
@lcastricato
Louis Castricato
1 year
Recent advances with language models have been powered by Reinforcement Learning with Human Feedback (RLHF). At Carper, we're developing production ready open-source RLHF tools. Blog post by @natolambert , myself, @lvwerra and @Dahoas1
1
38
151
@lcastricato
Louis Castricato
1 year
Since this will only make me cool for a few more days, I have early access to an RLHF tuned open assistant. () Comment below with prompts.
10
19
152
@lcastricato
Louis Castricato
2 years
Thanks for the retweet! Are you interested in doing RLHF for story generation but using no human annotation data? Our research direction at Carper has got you covered! We explore the limitations of CLIP-like story critic models ala CARP and show various
@_akhaliq
AK
2 years
Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning abs:
Tweet media one
1
58
304
4
17
110
@lcastricato
Louis Castricato
1 year
. @StabilityAI neurips 2022 party has commenced #StabilityNeurIPS
Tweet media one
7
5
92
@lcastricato
Louis Castricato
1 year
. @carperai just passed a major milestone a few days ago, its now a whole year old. I thought we could take this time to recap the achievements we've accomplished over the last year as well as discuss potential directions. of the lab. A thread 🧵1/N
2
20
92
@lcastricato
Louis Castricato
3 years
Three first author accepts :’)
5
1
71
@lcastricato
Louis Castricato
2 months
we're hiring for all roles. Open science stuff we're working on: 1) RLAIF for pretraining (we're making open source datasets). 2) benchmarks benchmarks benchmarks. 3) collaborating with @AiEleuther on some awesome projects. Work with us.
4
16
69
@lcastricato
Louis Castricato
4 years
@geoffreyhinton Yes, if t can’t explain how it works then how do you know who is to be held liable if something goes wrong? How do you know if it’s reasoning was flawless but it just so happen to be unlucky circumstances?
8
2
62
@lcastricato
Louis Castricato
2 months
you're looking at two people working on sick open source research. work with us.
Tweet media one
3
4
63
@lcastricato
Louis Castricato
2 years
Has anyone like seriously looked into training a speech language model on dolphins?
11
2
61
@lcastricato
Louis Castricato
2 years
Tweet media one
1
3
58
@lcastricato
Louis Castricato
2 years
So excited for my first day at HF 🥰
2
2
56
@lcastricato
Louis Castricato
1 year
It finally arrived 🥰 @carperai
Tweet media one
5
2
56
@lcastricato
Louis Castricato
5 months
doing an open source RLHF/mech interp meet up. Capacity is a bit limited. I'll be accepting people with substantial open source work in this space first and foremost.
1
12
20
@lcastricato
Louis Castricato
3 years
Releasing the demo notebook! You can try CARP-L here, just click "Run all" You can edit the stories and the critiques. It'll perform prompt softening under the hood using Pegasus.
@_akhaliq
AK
3 years
Cut the CARP: Fishing for zero-shot story evaluation abs: a scalable, efficient method for performing qualitatively superior, zero-shot evaluation of stories and a new corpora composed of 1.3M aligned story-critique pairs derived from over 80,000 stories
Tweet media one
4
22
120
2
10
48
@lcastricato
Louis Castricato
2 years
I finally met one of my favorite geese @iScienceLuvr
Tweet media one
3
1
45
@lcastricato
Louis Castricato
3 years
Zero shot semantic style transfer for editing faces. Original, Spock, James T. Kirk designed by Bauhaus, Low poly Hikaru Sulu. Edits are performed using CLIP system guided by L-grammars and Faces VQGAN.
Tweet media one
Tweet media two
Tweet media three
Tweet media four
5
2
44
@lcastricato
Louis Castricato
1 year
@mark_riedl Holy shit I almost died laughing, I'm waiting for someone to try to recruit a candidate they found via a ouija board for a company board
0
1
42
@lcastricato
Louis Castricato
2 years
Shame on @WriteSonic for using Stable Diffusion on their website while (illegally) not mentioning the RAIL license or crediting @StabilityAI . I really hope you guys didn't try to pass that model as your own to VCs ;)
3
6
39
@lcastricato
Louis Castricato
6 months
Spotted at #EMNLP
Tweet media one
4
3
39
@lcastricato
Louis Castricato
3 years
@ak92501 You beat me to tweeting my own paper LMAO
1
1
39
@lcastricato
Louis Castricato
1 year
I have some stuff coming out later today on RLHF, keep your eyes peeled 😎
0
0
38
@lcastricato
Louis Castricato
6 months
Presenting on trlX at 20C #EMNLP
Tweet media one
2
4
35
@lcastricato
Louis Castricato
1 year
new activation function just dropped
Tweet media one
2
0
32
@lcastricato
Louis Castricato
6 months
Thanks to everyone who came out to the third new england RLHF hackers hackathon. We're going to be doing the next RLHF hackathon at NeurIPS 2023. An exact time and date has yet to be decided, but you can join the discord for more information.
2
7
31
@lcastricato
Louis Castricato
1 year
Tweet media one
@lcastricato
Louis Castricato
1 year
I closed on the house 🥰
2
0
23
3
0
31
@lcastricato
Louis Castricato
3 years
I just bought and I have no idea what to put there
7
1
30
@lcastricato
Louis Castricato
4 years
@dril_gpt2 She tied pulling dril GPT2 out of his GPU. Awful! AI cruelty!
1
0
26
@lcastricato
Louis Castricato
12 days
honored to be on @swyx 's high signal ai list, which is pleasantly devoid of @iScienceLuvr 🥰
3
0
28
@lcastricato
Louis Castricato
2 months
Hiring a world class ML Engineer at SynthLabs. Drop me a note if you want to push world class open science and build a strong and loving community.
2
7
28
@lcastricato
Louis Castricato
3 years
Carp seq2seq (based off of GPT-J) is on the eye. You provide stories and it critiques them. Carp eval, for automated story evaluation, will be out within the coming weeks. License is MIT. Paper soon.
Tweet media one
2
3
28
@lcastricato
Louis Castricato
10 months
The amount of recruiters I've had contact me in the past 24 hours is *insane* LMFAO
2
0
28
@lcastricato
Louis Castricato
1 year
@ylecun The model is clearly not GPL though no? Only the code is?
1
0
27
@lcastricato
Louis Castricato
1 year
@aichip1 @drjwrae RLHF at smaller scales absolutely does work. We have some stuff coming out soon, I'm pretty excited.
0
3
28
@lcastricato
Louis Castricato
4 years
@colinraffel “Oh cool some hard math! The work must be right then” it’s so hard to get feedback on my mathematics if no one in my field fully understands what I’m actually doing! Everyone gets it at an intuitive level sure, but I still don’t know if I’m using the right abstractions.
1
0
28
@lcastricato
Louis Castricato
3 years
Tweet media one
0
2
26
@lcastricato
Louis Castricato
2 years
Sci-fi cyberpunk canada goose, trending on art station #DALLE
Tweet media one
1
2
26
@lcastricato
Louis Castricato
1 year
Got something really spicy coming tomorrow ;)
2
0
25
@lcastricato
Louis Castricato
4 years
In an event to mimic the success of @KordingLab 's recent neuro event, I decided to host my own pure math in deep learning event. Here is the calendar event URL. Please RSVP if you intend to attend. The meeting is tomorrow night (Tuesday) at 10PM EST.
10
10
25
@lcastricato
Louis Castricato
2 years
@arankomatsuzaki The real peer review begins when Phil implements this
0
0
25
@lcastricato
Louis Castricato
2 years
i'll give $50 and an appropriate amount of compute to someone who wants to finetune stable diffusion on geese for me.
3
0
24
@lcastricato
Louis Castricato
4 years
@arankomatsuzaki Absolutely shameless plug: I wrote about this exact relationship back in April. I think my theory took it in a different direction than theirs as I borrowed more deeply from the cognitive neuroscience side of things. I should contact the authors.
1
2
22
@lcastricato
Louis Castricato
2 years
Trying to figure out interest, who would be interested in a @StabilityAI launch event in Boston on October 16th? A dinner party in Cambridge, hosted by yours truly.
Not interested
96
Interested
89
Tentative
38
6
4
23
@lcastricato
Louis Castricato
2 years
@ClementDelangue There's an Eleuther meetup in dolores park at 3pm on Sunday if that counts 😆
5
1
23
@lcastricato
Louis Castricato
2 years
Tweet media one
0
1
22
@lcastricato
Louis Castricato
1 year
I closed on the house 🥰
2
0
23
@lcastricato
Louis Castricato
1 year
@omarsar0 Incredibly misleading. We have no fast how ChatGPT trained or if it was based on a chinchilla-like approach. Similarly its only 15x faster because inference fits on a single gpu, not because of anything the repository does. This is a thin DS wrapper around a naive PPO implt.
2
1
23
@lcastricato
Louis Castricato
1 year
Come watch me talk about past present and future of RLHF @Carperai tonight at 5:55 PM EST on
0
3
22
@lcastricato
Louis Castricato
4 years
@dril_gpt2 Do you think all the GPT2 accounts are friends
1
0
18
@lcastricato
Louis Castricato
1 year
Anyone have a paper to cite that all RLHF is implicitly model based RL? Seems super obvious but I can't recall any papers making this claim, just a few lesswrong posts.
5
0
22
@lcastricato
Louis Castricato
10 months
@josephofiowa This is the kind of content I use this platform for
1
0
21
@lcastricato
Louis Castricato
6 months
Stella is completely right. I am at EMNLP right now and I've already heard two people quote @kchonyc as if his joke was fact.
@BlancheMinerva
Stella Biderman
6 months
@kchonyc @GoogleDeepMind This is misinformation. The model does not know this information and people will take this as evidence that the numbers are correct. Please delete it and refrain from encouraging people to ask models about themselves
5
2
142
2
0
21
@lcastricato
Louis Castricato
1 year
Tiny RLHF ideas!
@savvyRL
Rosanne Liu
1 year
Now that we can write Tiny Papers @iclr_conf , what should we write about? I'd like to invite all established researchers to contribute Tiny Ideas as inspirations, seeds for discussions & future collaborations! #TinyIdeasForTinyPapers I'll start. Note: bad ideas == good starts.
5
22
166
1
4
21
@lcastricato
Louis Castricato
7 months
I'll be at EMNLP and neurips this year if anyone wants to get dinner or hang out
8
0
21
@lcastricato
Louis Castricato
1 year
I just got the clear to close on my new house today 🥰
3
0
21
@lcastricato
Louis Castricato
2 years
Thank you google, incredibly based and goosepilled.
@GoogleAI
Google AI
2 years
MUSIQ is a new approach for image quality assessment (IQA) that uses a patch-based multi-scale transformer architecture, bypassing the typical fixed input size constraint of CNN-based architectures, to achieve state-of-the-art IQA performance. Read more →
Tweet media one
4
56
221
1
1
19
@lcastricato
Louis Castricato
2 years
Credit EleutherAI memes channel
Tweet media one
1
0
20
@lcastricato
Louis Castricato
1 year
Coming soon to a Carper near you...
Tweet media one
4
1
20
@lcastricato
Louis Castricato
2 years
Tweet media one
0
0
19
@lcastricato
Louis Castricato
6 months
Hanging out at neurips come say hi
Tweet media one
0
0
18
@lcastricato
Louis Castricato
4 years
Joining @gtcomputing in the fall as an MSCS student.
Tweet media one
5
0
18
@lcastricato
Louis Castricato
1 year
In the coming weeks and months, we'll be putting out blog posts outlining our new direction. We want to make RLHF accessible to everyone, no matter how advanced you are. We want to lower the barrier of entry until some person in their basement with a GPU can make ChatGPT.13/N
1
0
18
@lcastricato
Louis Castricato
6 months
I'm hanging out on the conference floor today if anyone wants to meet up!
Tweet media one
3
0
18
@lcastricato
Louis Castricato
2 years
I'm ready for my birthday honks 💅
5
0
18
@lcastricato
Louis Castricato
3 years
@Hector_Lowe @_joaogui1 Picture of the paleontologists
Tweet media one
0
0
17
@lcastricato
Louis Castricato
1 year
Passed part of my quals yesterday, got a new road bike to celebrate (have wanted a new one since last summer anyway and had been saving up for it since then). Never liked drop bars but for some reason on the Kona they're so incredibly comfortable I'm kinda shocked haha
Tweet media one
1
0
17
@lcastricato
Louis Castricato
1 year
@_joaogui1 @rodrigfnogueira @carperai Didn't wanna reply from the CarperAI account since this is my opinion and not my org's opinion but we have some preliminary results already confirming that RL makes a noticeable difference in terms of capabilities. 1/2
2
1
17
@lcastricato
Louis Castricato
2 years
@hardmaru Aaand now I want a 4090 :/
2
0
17
@lcastricato
Louis Castricato
5 months
Hi 👋 if you are interested in: 🪿 Goose explainers, discussion, and resources 📃 Sharing the top goose related news 🔬 Learning more about geese 😂 Goose-related memes Consider following me. ✔️ I have lots of great content planned for the new year! Stay tuned!🎉
@iScienceLuvr
Tanishq Mathew Abraham, Ph.D.
5 months
Hi 👋 if you are interested in: 🤖 AI explainers, discussion, and resources 📃 Sharing the top AI papers every week 🔬 Learning more about medical AI research 😂 AI-related memes Consider following me. ✔️ I have lots of great content planned for the new year! Stay tuned!🎉
6
4
131
1
0
17
@lcastricato
Louis Castricato
2 years
Tweet media one
@GaryMarcus
Gary Marcus
2 years
Arithmetic Smackdown! 🥊 👉 Worst fiasco in CPU in history vs 👉 Deep learning’s greatest hit! And the winner is …
Tweet media one
10
20
103
1
2
17
@lcastricato
Louis Castricato
1 year
@repligate I'm using RLHF to make goosegirls to be fair
1
0
16
@lcastricato
Louis Castricato
9 months
I got plants 😎 I'm a plant dad now
Tweet media one
0
0
16
@lcastricato
Louis Castricato
10 months
Love being able to dunk on RLHF for five minutes straight and still get an applause 🥰
@lina_colucci
Lina Colucci, PhD
10 months
Tweet media one
0
4
25
0
0
16
@lcastricato
Louis Castricato
3 years
@kaixhin Basically failed introduction to neuroscience, went on to win a best paper a few months after on neuroscience
1
0
15
@lcastricato
Louis Castricato
2 years
New CARP model just dropped! Model: Example script: Half the number of parameters but matches CARP-L performance across the board. Also perhaps subjective but qualitatively the results it produces are much more satisfying ;)
@lcastricato
Louis Castricato
3 years
Releasing the demo notebook! You can try CARP-L here, just click "Run all" You can edit the stories and the critiques. It'll perform prompt softening under the hood using Pegasus.
2
10
48
1
2
14
@lcastricato
Louis Castricato
1 year
Twenty minutes until the stability party 😎
1
0
15
@lcastricato
Louis Castricato
1 month
Tweet media one
@nabeelqu
Nabeel S. Qureshi
1 month
Waterloo grads are unbelievably cracked software engineers. Has anyone written up an essay on what they’re doing there? Would be interested to read.
117
87
2K
1
1
15
@lcastricato
Louis Castricato
4 years
Tweet media one
0
0
15
@lcastricato
Louis Castricato
2 years
Tweet media one
0
0
15
@lcastricato
Louis Castricato
1 year
Releases are always so stressful....
1
0
14
@lcastricato
Louis Castricato
4 years
@pixlpa Fabula vs syuzhet. The idea that a written story is different than the actual raw story (Eg a story need not be written to express identical information)
1
2
14
@lcastricato
Louis Castricato
3 years
me n the boys having an absolute rager
Tweet media one
1
0
14
@lcastricato
Louis Castricato
4 years
@gabrielpeyre @jwkritchie Bill Cook did a whole course on integer programming and solving TSP using DNNs. I recommend you read it! It’s super cool. A lot of people are copying his work now adays and not crediting him, it really sucks. He did combinatorial deep learning back in 2012ish
2
2
14
@lcastricato
Louis Castricato
2 years
Come work on fun code projects with us at #EleutherAI we need an AWS expert! Easy way to get onto a paper with the lovely @TaliaRinger @moyix and @BlancheMinerva :)
@EricSchles
Eric Schles
2 years
I'm trying to get this code moved from Azure terraform to AWS terraform: anyone want to pair with me?😅
0
0
3
2
6
14
@lcastricato
Louis Castricato
1 year
First driving lesson today, I kept forgetting I was the driver.
1
0
14
@lcastricato
Louis Castricato
1 year
:goose16:
Tweet media one
1
0
13
@lcastricato
Louis Castricato
3 years
“Towards a Model-theoretic View of Narratives” @BlancheMinerva @recardona @DavidThue and myself describe a foundation of narratology in the vocabulary of model and information theory. 1/N
2
0
14
@lcastricato
Louis Castricato
1 year
Feelin like a silly goose tn
Tweet media one
0
0
14
@lcastricato
Louis Castricato
1 year
@arankomatsuzaki 1) algorithm distillation 2) evolution through chain of thought to build out extensive behavior cloning datasets 3) Offline RL, results in better reward/TFLOPS 4) synthetic preferences 5) various (many) ways to improve generation diversity and prevent modal collapse.
1
0
13
@lcastricato
Louis Castricato
3 years
I should make a bot that screenshots anyone's #NFT and sells it at less than what they were selling it at (and donates all income)
1
0
13
@lcastricato
Louis Castricato
6 months
What if Q* was the friends we made along the way
0
0
13