pguptacs Profile Banner
Prateek Gupta Profile
Prateek Gupta

@pguptacs

Followers
204
Following
394
Media
3
Statuses
163

Postdoc | Max Planck Institute | AI for science | Exploring comp social science | Deep Learning | @UniofOxford @turinginst @MilaMontreal Views are my own.

Berlin, Germany
Joined August 2011
Don't wanna be here? Send us removal request.
@pguptacs
Prateek Gupta
4 days
LLMs trained with just 8-bit numbers? That led me down a rabbit hole. How does something so complex still work with such coarse formats?. Wrote about mixed-precision training, loss scaling, and what it might say about intelligence:.📚
0
0
1
@pguptacs
Prateek Gupta
12 days
Coverage by Scientific American 🥳.
@iyadrahwan
Iyad Rahwan | إياد رهوان
13 days
Great coverage by @sciam of our research on how ChatGPT is influencing the words we use in conversation!. Work led by @hiromu1996 & @LevinBrinkmann.
0
0
0
@pguptacs
Prateek Gupta
14 days
Excited to present 2 papers at #ICML!. 🧠 Dualities via ML: We rediscover the Kramers–Wannier duality using neural nets.📚 🌍 RICE-N: Agents learn to negotiate climate deals.📚
openreview.net
Global cooperation on climate change mitigation is essential to limit temperature increases while supporting long-term, equitable economic growth and sustainable development. Achieving such...
0
0
3
@pguptacs
Prateek Gupta
19 days
📚 Blog: 💻 Code: I wrote this to understand the nuts and bolts of LLM infra —.If you're on the same path, this might help. #PyTorch #LLMEngineering #DistributedTraining #MLInfra.
0
0
0
@pguptacs
Prateek Gupta
19 days
What’s inside 🛠️. ✅ Communication primitives (broadcast, all_reduce, etc.).✅ Data Parallelism — naive, hooks, bucketed.✅ Tensor Parallelism — MLP & attention, custom autograd ops.✅ Pipeline Parallelism — AFAB, 1F1B, Gantt charts.
1
0
0
@pguptacs
Prateek Gupta
19 days
So I built a series of toy implementations —.not for scale, but to explore how data, gradients, and layers move across devices. This is a playground for understanding:.- No clusters.- No DeepSpeed or Megatron.- Just PyTorch + Docker.
1
0
0
@pguptacs
Prateek Gupta
19 days
🧵 I just published:.Distributed Training for Dummies — a 4-part blog series. I was curious how LLMs are trained across 1000s of GPUs. Reading HuggingFace’s UltraScale Playbook helped —.but I wanted to see:.👉 What does this actually look like in code?.📚
1
0
0
@pguptacs
Prateek Gupta
1 month
I learnt a lot by “delving” into various topics around how word usage evolves with time — and learning about the fast-growing world of podcasting was fun. There is quite a lot of research being conducted to understand how podcast topics evolve with time.
0
0
0
@pguptacs
Prateek Gupta
1 month
I loved reading the empirical studies with societal impact, but this time I contributed to one. Our paper on the impact of ChatGPT on spoken communication ( just got featured in The Verge (.
Tweet card summary image
theverge.com
AI isn’t just impacting how we write — it’s changing how we speak and interact with others. And there’s only more to come.
1
0
0
@pguptacs
Prateek Gupta
9 months
Exciting to see our work come to life! This was such a fun collaboration. Looking forward to more insights!.
@nblqbl
Nabil Iqbal
9 months
Something a little bit different from my usual: with Andrea Ferrari and @pguptacs, we investigated whether we can use machine learning to find *dualities* in statistical physics. A short thread:
Tweet media one
0
1
4
@pguptacs
Prateek Gupta
9 months
RT @nblqbl: Something a little bit different from my usual: with Andrea Ferrari and @pguptacs, we investigated whether we can use machine l….
0
24
0
@pguptacs
Prateek Gupta
11 months
RT @hiromu1996: 📢 New Preprint!.Now it's known ChatGPT overuses words like 'delve' and 'adept.' This raises the possibility that through th….
0
10
0
@pguptacs
Prateek Gupta
2 years
Grateful for the opportunity to guide engineering minds at G-Research through the fascinating world of Attention, Transformers, and LLMs. The blend of academia and industry sparks great innovation! Let's keep the AI momentum going! #AI #LLMs
Tweet media one
0
1
6
@pguptacs
Prateek Gupta
2 years
Can't wait for the AI4GCC workshop! Over 2 years in the making, this competition bridges the gap between researchers and policymakers to tackle the lack of cooperation among nations. It's just the beginning of something big! #AI4GCC #bridgingthegap #researchers #policymakers.
@AI4ClimateCoop
ClimateAICompetition
2 years
📅 Save the Date!.Join us for the #AI4GCC2023 Workshop on April 26 at 10 a.m. EST. Delighted to have @alicelepissier & @a_tacchetti as our speakers. Witness teams showcasing innovative negotiation protocols! 🤖💡📎Join from here: #ClimateAction #AI
Tweet media one
0
0
1
@pguptacs
Prateek Gupta
2 years
RT @AI4ClimateCoop: 📅 Save the Date!.Join us for the #AI4GCC2023 Workshop on April 26 at 10 a.m. EST. Delighted to have @alicelepissier &….
0
4
0
@pguptacs
Prateek Gupta
3 years
RT @AI4ClimateCoop: We’re thrilled to announce that @DeepMind is now an official partner of AI4GCC, providing jury members and logistical s….
0
13
0
@pguptacs
Prateek Gupta
3 years
RT @DeepMind: Today in @Nature: #AlphaTensor, an AI system for discovering novel, efficient, and exact algorithms for matrix multiplication….
0
2K
0
@pguptacs
Prateek Gupta
3 years
RT @AI4ABM: We're excited to support the "AI for Global Climate Cooperation", a competition and collaborative research effort for everyone….
0
5
0