raulpuri.eth Profile
raulpuri.eth

@TheRealRPuri

Followers
6,829
Following
329
Media
19
Statuses
432

AI things @ OpenAI - Her, GPT4V, GPT4, GPT3.5, Codex | past: NVIDIA - megatron, sentiment neurons | go bears 🐻

Joined March 2014
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
@TheRealRPuri
raulpuri.eth
8 months
Fun fact: I’m a terrible web dev. I used gpt-4V at some point to write code for some of its own prototype GUIs.
@mckaywrigley
Mckay Wrigley
8 months
You can give ChatGPT a picture of your team’s whiteboarding session and have it write the code for you. This is absolutely insane.
677
5K
31K
22
50
632
@TheRealRPuri
raulpuri.eth
21 days
We fuckin sent it
@OpenAI
OpenAI
21 days
Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time: Text and image input rolling out today in API and ChatGPT with voice and video in the coming weeks.
3K
14K
65K
17
7
411
@TheRealRPuri
raulpuri.eth
16 days
@BUNNYEXPLODER Men like being called “beautiful”/“gorgeous” too!
12
4
402
@TheRealRPuri
raulpuri.eth
7 months
All us @OpenAI employees coming back from PTO after dev day
8
5
148
@TheRealRPuri
raulpuri.eth
5 years
We Just released a cool #PyTorch #NaturalLanguageProcessing project we've been working on: training an 8.3B GPT2 model with model parallelism. Check it out... Details: Training Code:
3
51
149
@TheRealRPuri
raulpuri.eth
6 months
Every time I open this image I end up having to play where’s Waldo again to find myself
@gdb
Greg Brockman
6 months
we are so back
Tweet media one
2K
4K
52K
9
35
112
@TheRealRPuri
raulpuri.eth
7 months
Team is committed to the mission
@MattPRD
Matt Schlicht
7 months
This is by far the most impressive part of @OpenAI this weekend. No matter what is going on, customers are being treated as #1 . ❤️
Tweet media one
37
74
872
6
26
107
@TheRealRPuri
raulpuri.eth
7 months
@gdb @sama I liked working with y’all. I hope our paths cross again so that we may work together once more. It was a pleasure
1
2
100
@TheRealRPuri
raulpuri.eth
23 days
Something magical is happening in SF this weekend for sure. The universe is winking.
Tweet media one
@sama
Sam Altman
24 days
not gpt-5, not a search engine, but we’ve been hard at work on some new stuff we think people will love! feels like magic to me. monday 10am PT.
1K
3K
28K
5
6
99
@TheRealRPuri
raulpuri.eth
8 months
If you wonder what we’ve been doing with vision since march’s gpt-4 announcement: Safety!
@_lamaahmad
Lama Ahmad لمى احمد
8 months
We've included a system card focused on the vision capabilities, building on the work from the GPT-4 system card. Thank you to all our expert testers and red teamers for helping to inform this work!
4
12
110
15
6
91
@TheRealRPuri
raulpuri.eth
8 months
Also very big shout out to @DustMason for helping us get this out to all you users
@gdb
Greg Brockman
8 months
It’s been really amazing watching @TheRealRPuri work so hard across the whole stack to make image inputs a reality. Congrats to core contributors @shgusdngogo , Jamie Kiros, @longouyang , @daniellevy__ , @choniong , @SandhiniAgarwal , awesome work all.
11
21
221
8
8
76
@TheRealRPuri
raulpuri.eth
6 months
Prompt engineering is just technical/document writing for robots instead of humans
@goodside
Riley Goodside
6 months
Is prompt engineering dead? No, it’s SoTA. GPT-4 with good prompts (dynamic k-shot + self-generated CoT + choice-shuffled ensembles) beats Med-PaLM 2 on all nine of the MultiMedQA benchmarks it was fine-tuned for, without fine-tuning:
Tweet media one
23
160
1K
4
41
43
@TheRealRPuri
raulpuri.eth
8 months
Shipping multimodal AI is hard. Urban planning is harder
@OwainEvans_UK
Owain Evans
8 months
The most implausible prediction from the movie "Her" is not the AI but high-density walkable Los Angeles.
Tweet media one
21
78
2K
1
5
41
@TheRealRPuri
raulpuri.eth
6 months
Gpt-4V for your dating life
@levelsio
@levelsio
6 months
✨ My first 100% ChatGPT startup is now monetized: ❤️‍🔥 - $9.99 to get a full dating profile review for your Tinder, Bumble or Hinge The site has NO landing page, which is new for me too, I just use the Stripe payment link as the landing page, which after
Tweet media one
Tweet media two
Tweet media three
175
80
2K
2
33
37
@TheRealRPuri
raulpuri.eth
6 months
Galaxy brain
@wgussml
william
6 months
CMU ml phd really coming in clutch for building gmail snooze button never thought i'd be using nonhomogenous poisson processes for email scheduling but 2024 here we come
1
1
9
2
33
36
@TheRealRPuri
raulpuri.eth
6 months
To my colleagues at google: WELCOME TO THE ARENA! @xiao_ted @drjwrae @vedantmisra
2
1
31
@TheRealRPuri
raulpuri.eth
1 year
@NandoDF I remember our rejections after submitting to sysML. One reviewer thought we weren’t a systems paper. Another reviewer pointed out that our architecture/model wasn’t novel. …… wat
0
0
30
@TheRealRPuri
raulpuri.eth
4 years
Excited to share our #acl2020 work on Large Scale Multi-Actor Generative Dialog Modeling, done with amazing coauthors Alex Boyd, @MostofaPatwary , Mohammad Shoeybi, @ctnzr ! Join our QA session 5-6 UTC tonight and 21-22pm UTC tomorrow.
1
6
24
@TheRealRPuri
raulpuri.eth
3 years
This was so fun to work on. So proud of the whole team 🎉🥳
@OpenAI
OpenAI
3 years
Welcome, @github Copilot — the first app powered by OpenAI Codex, a new AI system that translates natural language into code. Codex will be coming to the API later this summer.
69
784
3K
0
1
23
@TheRealRPuri
raulpuri.eth
8 months
@DrJimFan Ahhh how the goalposts move
2
0
22
@TheRealRPuri
raulpuri.eth
5 months
Congrats yall !
@_weiping
Wei Ping
5 months
ChatQA can outperform GPT-4 on a wide range of conversational QA tasks: - ChatQA and GPT-4 take the same top-5 chunks from our best retriever, when long documents are involved. - ChatQA performs very well on tabular data, arithmetic calculation, and “unanswerable” cases!
1
21
87
3
9
19
@TheRealRPuri
raulpuri.eth
6 months
@OpenAI tender achieved (chicken tender)
3
0
21
@TheRealRPuri
raulpuri.eth
7 months
@ilyasut Apologizing doesn’t mean one is right or wrong, it does mean however that one values their relationships more than their ego. Feeling valued. ❤️ OpenAi is nothing without its people.
1
0
20
@TheRealRPuri
raulpuri.eth
1 year
🥲
@trashh_dev
trash
1 year
F in chat
Tweet media one
68
407
5K
1
0
15
@TheRealRPuri
raulpuri.eth
5 years
Just read "Fine-Tuning Language Models from Human Preferences" from @OpenAI . I feel like this isn't being talked about as much as it should. Dope use case/implementation of GPT-2 and #NLProc . blog: github:
1
8
18
@TheRealRPuri
raulpuri.eth
8 months
Def need to do a longer list here, but also wanted to shout out @MadelaineBoyd for bringing y’all a safe n trustworthy user experience !
1
0
16
@TheRealRPuri
raulpuri.eth
6 months
@filippie509 Bruh don’t say that. Now you added it to the training data 🙃
1
0
16
@TheRealRPuri
raulpuri.eth
5 years
It’s amazing to see how much the #NaturalLanguageProcessing community has grown in the past few years and the problems we’re tacklin
@OpenAI
OpenAI
5 years
GPT-2 6-month follow-up: we're releasing the 774M parameter model, an open-source legal doc organizations can use to form model-sharing partnerships, and a technical report about our experience coordinating to form new publication norms:
31
343
945
0
3
17
@TheRealRPuri
raulpuri.eth
4 years
Stop by our poster "Zero-shot Text Classification With Generative Language Models" today at the #neurips19 Meta-Learning workshop. Learn about the intersection of Meta Learning/NLP and some of the challenges going forward. paper:
Tweet media one
1
7
16
@TheRealRPuri
raulpuri.eth
8 months
@mckaywrigley @mckaywrigley hi author here 👋. Can you purposefully try messing up and see if it can correct code based on a screenshot of the messed up deploy. Really curious to see examples of iterative/interactive usage
1
0
14
@TheRealRPuri
raulpuri.eth
1 year
🥲
@mattturck
Matt Turck
1 year
ChatGPT is clearly the child of immigrants because they keep forcing it to be a doctor or lawyer when all it wants to do is creative writing and hallucinating
24
134
1K
0
0
13
@TheRealRPuri
raulpuri.eth
7 months
@soumithchintala What qualifies as a step function though? Like bigger model/capability numbers go up? Personally I feel like the GPT-4V api is a step function and will change what devs build, we just promoted content related to changing how devs build instead. play with it, lmk...
2
1
10
@TheRealRPuri
raulpuri.eth
8 months
Y’all 👀
@E0M
Evan Morikawa
8 months
I’ll be giving a behind the scenes look at how we scaled ChatGPT at 10:30. Most of this we’ve never talked publicly about before and I’m super excited to share it here at @TheLeadDev West Coast.
Tweet media one
5
7
113
0
2
11
@TheRealRPuri
raulpuri.eth
5 years
If any #NLProc researcher is thinking of doing this, please don’t be an idiot and read this insightful thread first.
I've seen several different #NLProc folks suggesting today that it would fun/interesting/worthwhile to use BERT or GPT-2 to fill in the redacted bits of the Mueller report. A short thread on why this is a terrible idea /1
9
119
346
1
5
10
@TheRealRPuri
raulpuri.eth
6 months
@xiangyue96 First, love this work Second, hold my beer. Lemme cook for a year.
0
0
9
@TheRealRPuri
raulpuri.eth
5 years
More awesome tools from @nvidia to help create fast #PyTorch models.
@marekinfo
Marek Kolodziej
5 years
Proud to be the co-author of a new @PyTorch GPU performance analysis tool that just got merged into @nvidia Apex:
3
31
173
0
5
8
@TheRealRPuri
raulpuri.eth
2 years
@rtaori13 4x the tuition and also has a total student body a fraction of the size
0
0
9
@TheRealRPuri
raulpuri.eth
7 months
@DrJimFan 180$ for now*
1
0
6
@TheRealRPuri
raulpuri.eth
1 year
Perks of working at OAI: cool paper clips
@tszzl
roon
1 year
oh shit it’s starting
Tweet media one
53
52
1K
0
0
8
@TheRealRPuri
raulpuri.eth
8 months
Correct half the time is when something becomes more useful than unuseful. 📈✈️ Everyone's at the starting line of something new and wonderful. ✨💫
@lupantech
Pan Lu
8 months
🔥 Exciting Update! We've manually evaluated #GPT4V using the playground chatbot on #MathVista , our newest benchmark for visual mathematical reasoning. 🚀 #GPT4V soared with a 15.1%⬆️ improvement over #Bard , setting a new record at 49.9%! 🎉 🌐 Yet,
Tweet media one
3
28
136
0
3
8
@TheRealRPuri
raulpuri.eth
2 years
@kevin_zakka Another take - Maybe a healthy way to make an impact on robotics isn’t to work directly on robotics
0
0
7
@TheRealRPuri
raulpuri.eth
5 years
MEGATRON Update: 4x more data leads to new WikiText and LAMBADA sota. Check out our updated blog post and @WeCNLP poster ( #21 ). #NaturalLanguageProcessing
1
6
7
@TheRealRPuri
raulpuri.eth
3 years
We need an osmosis jones reboot for COVID. Thank you for coming to my TED talk.
Tweet media one
0
3
7
@TheRealRPuri
raulpuri.eth
7 months
@wangzjeff Damn man, I hope you’re kids never get a 740 on their SAT
1
0
6
@TheRealRPuri
raulpuri.eth
1 year
Academic graduations, other ceremonies, etc. are beautiful. You feel so connected to the fabric of humanity. Wish I could bottle the feeling and share it with others.
0
0
6
@TheRealRPuri
raulpuri.eth
7 months
@mezaoptimizer A story from creating gpt-4V: 2 years ago while moving apartments I almost threw away my old college notes. Ppl didn’t understand why I wanted to hold on to them. Back then I knew where we were headed. I knew I’d eventually be able to plug them into an AI capable of reading them.
1
0
6
@TheRealRPuri
raulpuri.eth
22 days
Aight yall. The team sent it on this one. Sleepy time now. Will see yall on the other side.
1
0
6
@TheRealRPuri
raulpuri.eth
5 years
@eukaryote314 ~800M gpt2. this is largely due to the memory required to House parameters + optimizer states. If one uses a smaller optimizer than Adam training something larger should be possible. Make sure to turn on activation checkpointing with —checkpoint-activations
0
0
6
@TheRealRPuri
raulpuri.eth
23 days
@kimmonismus “All that remains is to wait for the roll-out not to take another 6 months.” Bruh, like can I get a vacation…
0
0
5
@TheRealRPuri
raulpuri.eth
8 months
@tszzl @tszzl lies. I saw him give access to gpt-7 not 5
2
0
6
@TheRealRPuri
raulpuri.eth
8 months
@gdb I feel this so hard rn 😭
0
0
5
@TheRealRPuri
raulpuri.eth
5 years
@alexeev_eu @Miles_Brundage We’re working on user friendliness before getting to that point. As you can imagine massive models like this aren’t readily accessible to everyone.
1
0
5
@TheRealRPuri
raulpuri.eth
7 months
Big alpha
@ChrisJBakke
Chris Bakke
7 months
BREAKING: OpenAI and Burger King to merge. They have tapped the CEO of Mattress Firm to run the combined company. The former CEO of Mattress Firm (4th CEO of OpenAI in 72 hours) was fired after 7 minutes for Slacking the entire company (the remaining 9 employees) a gif of
131
170
4K
3
0
5
@TheRealRPuri
raulpuri.eth
4 years
@joeddav @srchvrs @joeddav adding some additional context to the sliding window/tokenization normalization discussion. We cover both in appendix E.1 of Megatron-LM (including the exact values for normalization and window size we got from openai)
1
1
5
@TheRealRPuri
raulpuri.eth
11 months
@Miles_Brundage Liking 10 tweets rq
0
0
2
@TheRealRPuri
raulpuri.eth
5 years
Awesome audio. Awesome authors. All around awesome. 🎶🎵🎶🎵
@RafaelValleArt
Rafael Valle
5 years
We just released the paper and code for Mellotron: a multispeaker voice synthesis model that can make a voice emote and sing without emotive or singing training data.
3
119
391
0
1
5
@TheRealRPuri
raulpuri.eth
6 months
@joannejang Felt this…
0
0
5
@TheRealRPuri
raulpuri.eth
4 years
We pretrain large generative language models on multi-actor reddit conversations. Furthermore, by conditioning on users' past reddit conversations we use in-sequence meta learning to control and personalize the dialogue model's response.
1
2
5
@TheRealRPuri
raulpuri.eth
3 years
Ken griffin buying the constitution is a depressingly poetic reflection of the times: capitalist overlord buys out the us constitution. Orgs like @ConstitutionDAO are more necessary than ever. … wagmi
0
0
4
@TheRealRPuri
raulpuri.eth
1 year
@nishrreddy So cute beta
0
0
4
@TheRealRPuri
raulpuri.eth
6 months
@tszzl Spotify called me out on adhd 🥲
Tweet media one
2
0
4
@TheRealRPuri
raulpuri.eth
1 year
@soumithchintala I think ppl also underestimate how much harder it is to go from 80->90% on a task than 70->80% on a task. That asymptote at 100% is nasty business. Reaching the asymptote is also necessary for prod reliability. Add the difficulty of generality compounding on top of that as well.
1
1
4
@TheRealRPuri
raulpuri.eth
1 year
I feel so exposed rn
@Quasilocal
Steve McCormick
1 year
The true academic path
Tweet media one
21
412
3K
0
0
4
@TheRealRPuri
raulpuri.eth
4 years
@egrefen @PyTorch Wooo no more memory leaks
1
0
4
@TheRealRPuri
raulpuri.eth
1 year
@hardmaru How about globe-centric? Japan-centric feels like it’s missing the point…
0
0
4
@TheRealRPuri
raulpuri.eth
4 years
@DeepMind @ai_ucl Is there a syllabus or website with a list of speakers/talks?
1
0
4
@TheRealRPuri
raulpuri.eth
6 months
@unusual_whales Moving the money supply closer to the coke and lambo supply 💸💸💸
1
0
4
@TheRealRPuri
raulpuri.eth
7 months
@gdb @gdb noooo our image platform isn't ready to turn into a video platform.😭
0
0
4
@TheRealRPuri
raulpuri.eth
1 year
@RandolphCarterZ @tszzl Having been part of both parties’ work… nah. Researchers believe in 99.9% of the words they write. Researchers care deeply about what we work on. We don’t publish something we’re unhappy with. The remaining .1% is disagreement with coauthors, who we ultimately believe in…
0
0
4
@TheRealRPuri
raulpuri.eth
6 months
@saranormous Over the last few years we got computers that could use fuzzy logic with discreet inputs. Now we get fuzzy logic and fuzzy inputs. Never before in history have we had a computer that could see
1
0
4
@TheRealRPuri
raulpuri.eth
5 months
@brokekatmtn Scientology?…
0
0
4
@TheRealRPuri
raulpuri.eth
1 year
@konet @karpathy At what timestamp does he say that? Not tryna sit through this whole vid😅
0
0
4
@TheRealRPuri
raulpuri.eth
7 months
@gdb @gdb bruh didn’t you demo this 🙃…
1
0
3
@TheRealRPuri
raulpuri.eth
4 years
By pretraining the model with a N-way multiple choice title prediction class we can achieve up to a 45% absolute accuracy boost in common downstream text classification tasks.
1
1
3
@TheRealRPuri
raulpuri.eth
7 months
@landay @OpenAI @StanfordHAI Academia politics exist and are not fun either
1
0
3
@TheRealRPuri
raulpuri.eth
3 years
@agazdecki @saastr totally not a founder, just an AI researcher, but would love to join. Would love to see some change in the funding/acquisition landscape for more capex-intense science/eng ventures
0
0
3
@TheRealRPuri
raulpuri.eth
3 years
@WhiteClaw the claw is the law. I feel like this should be the new logo
Tweet media one
1
0
3
@TheRealRPuri
raulpuri.eth
7 months
@aaron_defazio It’s def easier to model and theorize about, that’s for sure
0
0
1
@TheRealRPuri
raulpuri.eth
5 years
@eukaryote314 I mean they’re not mutually exclusive. If you make a model 10x larger while making it 10x more efficient that’s a win in both axis. The point is that now these models can be trained
3
0
3
@TheRealRPuri
raulpuri.eth
8 months
@xiao_ted @xiao_ted this shit was delicious
0
0
2
@TheRealRPuri
raulpuri.eth
17 days
@SmokeAwayyy @ChatGPTapp “- Add a counter or symbol as you approach a text or image usage cap. - Add a timer showing time until caps are lifted.” Lmao, ChatGPT becomes clash of clans
0
0
3
@TheRealRPuri
raulpuri.eth
6 months
@jeremyphoward I mean like … damn, can we take a vacation
1
0
3
@TheRealRPuri
raulpuri.eth
11 months
@gdb Looking at the full log stream let’s us leverage the in context learner inside of us all 🙃
0
0
2
@TheRealRPuri
raulpuri.eth
3 years
So cool to finally see this go live. Can't wait to see what we'll do next.
@OpenAI
OpenAI
3 years
We’ve developed two neural networks which have learned by associating text and images. CLIP maps images into categories described in text, and DALL-E creates new images, like this, from text. A step toward systems with deeper understanding of the world.
Tweet media one
100
951
3K
0
0
2
@TheRealRPuri
raulpuri.eth
2 years
@sama Re: think very long term and execute relentlessly in the short term…… a pragmatic version of this from Jensen Huang is ~”think long term, but make sure you don’t die in the short term”
0
0
3
@TheRealRPuri
raulpuri.eth
4 years
@royschwartz02 @nlpnoah Congrats on the really cool paper. Did you guys explore connections to tensor-product transformers ? An input dependent manipulation of attention heads is used in both. This would be a really interesting ablation to inform future architecture designs.
1
0
3
@TheRealRPuri
raulpuri.eth
7 months
@Scobleizer Gpt-9 with vision > gpt-4 with vision
2
0
3
@TheRealRPuri
raulpuri.eth
1 year
@ariskonstant IKEA instruction writers are prompt engineers and I’m an overparameterized furniture assembling LM🙃
0
0
3
@TheRealRPuri
raulpuri.eth
4 years
Sharing some recent work with awesome collaborators :)
@sazoo_nlp
Sashank
4 years
Happy to announce that my internship work @NVIDIAAI has resulted in this publication. Big thanks to @wpingnet @TheRealRPuri @MohammadShoeybi @MostofaPatwary @ctnzr for being awesome mentors 😁 #NLProc
1
0
7
0
0
2
@TheRealRPuri
raulpuri.eth
5 years
@wgussml @rsalakhu Dope to see the evolution of your work from undergrad. Congrats Will <3!
0
0
3
@TheRealRPuri
raulpuri.eth
4 years
Come find out about our plans to incorporate text-to-text generative LMs with fewshot learning and other NLP tasks.
0
1
3
@TheRealRPuri
raulpuri.eth
4 years
Great mental model for processing any piece of twitter hype. ❤️
@deliprao
Delip Rao e/σ
4 years
It's out! The first @pagestlabs issue is on how to think about the buzz in breakthrough technologies like GPT-3 while living in the midst of it. Thanks everyone who subscribed early. Hope you like reading long posts 😅🖖
4
74
234
0
0
3
@TheRealRPuri
raulpuri.eth
2 years
Big one
@rjrshr
Rajarshi Roy
2 years
Arithmetic circuits were once the craft of human experts, and are now designed by AI in NVIDIA GPUs. H100 chips have nearly 13,000 AI designed circuits! How is this possible? Blog + a thread 🧵👇
25
415
2K
0
0
3
@TheRealRPuri
raulpuri.eth
5 months
@AMAZlNGNATURE This is just the grad night party scene from happy feet
0
0
2