Sanyam Bhutani Profile Banner
Sanyam Bhutani Profile
Sanyam Bhutani

@bhutanisanyam1

Followers
34,654
Following
996
Media
933
Statuses
8,009

👨‍💻 Sr Data Scientist @h2oai | Previously: @weights_biases 🎙 Podcast Host @ctdsshow 👨‍🎓 International Fellow @fastdotai 🎲 Grandmaster @Kaggle

India
Joined October 2016
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@bhutanisanyam1
Sanyam Bhutani
2 years
This is the best week of my life 🙏 ✅ Reached Kaggle Grandmaster tier ✅ My ML Hero & Guru: @jeremyphoward was kind enough to host me for an interview about my journey I promise to continue creating ML content to the best of my ability & sincerely take up competitions next 🍵
@jeremyphoward
Jeremy Howard
2 years
This week I'm filling in for regular "chai time data science" podcast host @bhutanisanyam1 , with a very special interview with a recently-anointed Kaggle grandmaster...
10
40
267
19
10
268
@bhutanisanyam1
Sanyam Bhutani
1 year
“Transformers from scratch” by Brandon Rohrer 🤖 This is one of the best write ups, that starts from 0 and explains every single detail of the model architecture. Even if you need a refresher or don’t, I would still highly recommend reading it:
Tweet media one
38
555
2K
@bhutanisanyam1
Sanyam Bhutani
10 months
Easily the best paper on current State of LLMs! 🙏 A 50 page read but it’s not “just another” survey paper, that only documents facts. The authors actually add very useful commentary capturing all aspects of building Large Language Models. Hence the result is a collection of
Tweet media one
40
321
2K
@bhutanisanyam1
Sanyam Bhutani
2 years
An absolute masterclass by World's Top Data Scientists 🙏 The awesome Kaggle Grandmaster Team at NVIDIA shares their winning tips and tricks in this series:
Tweet media one
5
340
2K
@bhutanisanyam1
Sanyam Bhutani
4 years
Google Colab has now a Subscription model for Power users: - Faster GPUs - Longer runtimes - More memory It's for $9.99/Month. I know many power users might enjoy it:
23
380
1K
@bhutanisanyam1
Sanyam Bhutani
9 months
How to become an expert at any thing 🙏 I rediscovered this gem by @karpathy in my bookmarks today
Tweet media one
21
159
1K
@bhutanisanyam1
Sanyam Bhutani
9 months
My favourite LLM paper is finally open source! 🙏 Running a single Large Language Model agent is easy Running multiple is hard Running multiple over days of sustained interactions is really hard I’ve spent the last 3 days reading through the code of the paper that solved
Tweet media one
13
208
1K
@bhutanisanyam1
Sanyam Bhutani
1 year
This is the best resource to get started in NLP in 2023 🙏 In 2 days, I will be kicking off a weekly study group to learn with everyone: @_lewtun will kindly join us for an opening AMA
Tweet media one
18
120
967
@bhutanisanyam1
Sanyam Bhutani
9 months
CS 324 Notes are a LLM Book! 🙏 The Large Language Model course notes are a crispy book covering the foundations Perfectly structured like an onion, starting at an overview & then levelling up It’s Mixture of Expert section is one of the best:
Tweet media one
10
226
926
@bhutanisanyam1
Sanyam Bhutani
1 year
The best summary of Transformers and it’s evolution 🙏 I found @giffmana ’s slides from 2022 to be the best “pocket reference” on the topic. Many posts have covered Transformers however this one also covers the state of field before and how it got adopted to different domains.
Tweet media one
11
158
907
@bhutanisanyam1
Sanyam Bhutani
10 months
Best tutorial on setting up LLMs locally! 🙏 @Rob_Mulla made an end to end video teaching how to install, run with GUI and connect a Large Language Model to your own data on your own machine. All open source, running offline:
Tweet media one
13
158
864
@bhutanisanyam1
Sanyam Bhutani
1 year
Watching “State of GPT” by @karpathy is the best 40 minutes you will spend this week 🙏 I actually found it really helpful for filling a lot of my knowledge gaps: - Comparisons against human brain and LLM brain - Why prompting works and why is it helpful to ask a model to “be
Tweet media one
17
124
837
@bhutanisanyam1
Sanyam Bhutani
1 year
Officially wrapping up the @kaggle Top Solutions Series 🙏 I’ve hosted over 25 videos sharing and explaining tricks, secrets of Kagglers for all domains of machine learning. The series is quite complete & I’m graduating to more challenges:
Tweet media one
13
174
822
@bhutanisanyam1
Sanyam Bhutani
1 year
“Pretend you’re an Indian parent” 😂
Tweet media one
33
73
807
@bhutanisanyam1
Sanyam Bhutani
1 year
The best tutorials on building LLM powered applications 📚 @GregKamradt is an incredible teacher of @LangChainAI : ✅ Top down & applied series ✅ Amazing teaching style ✅ Very practical examples
Tweet media one
20
158
797
@bhutanisanyam1
Sanyam Bhutani
1 year
Papers I’ve read in the last month! 🙏 I’m currently writing a LLM roadmap along with my notes. If anyone is interested in reviewing and providing early feedback-please reach out!
Tweet media one
85
31
771
@bhutanisanyam1
Sanyam Bhutani
5 years
MAJOR personal update: I’ll be starting my work in a full time role @h2oai today as a Machine Learning Engineer and AI Content Creator! I’m really excited to be a part of a team of many of my “ML Heroes” and THE best kagglers. Recap on my ML journey:
73
60
749
@bhutanisanyam1
Sanyam Bhutani
8 months
Implementing LLaMA from scratch! 🙏 This implements a LLM in the style that Karpathy implemented nanoGPT Even though it focuses on LLaMA-1, it’s a refreshing code first read Perfect for a Sunday crispy read:
Tweet media one
5
160
728
@bhutanisanyam1
Sanyam Bhutani
1 year
Arxiv Chat: Chat w the latest papers 🙏 I made a really simple demo that makes it easy for me to understand the latest papers. The whole app is <100 lines of code: ✅ @LangChainAI for the main logic ✅ @h2oai Wave for the UI ✅ ChatGPT for asking Qs
27
93
730
@bhutanisanyam1
Sanyam Bhutani
4 years
I'll say this out loud since no one does. I studied CS at college, it didn't make me a better programmer. Practising Programming makes you a better programmer, not studying it. If you're starting your ML Journey,trust me a "CS background" won't be as helpful as practising code
36
89
714
@bhutanisanyam1
Sanyam Bhutani
11 months
Run 13B model on an iPhone! 🤯 Just finished reading @Tim_Dettmers ’ amazing work on SpQR. SpQR unlocks 3.35 bit quantisation which lets us run 33B models on 3090s and 13B models on an iPhone. Here are my notes from the paper: - Quantisation is basically like compressing the
Tweet media one
29
113
704
@bhutanisanyam1
Sanyam Bhutani
2 years
Finally got my @kaggle Grandmaster hoodie! 🙏🍵
Tweet media one
19
5
690
@bhutanisanyam1
Sanyam Bhutani
7 months
Insanely detailed notes on Training LLMs! 🙏 @StasBekman has shared his field notes on training foundational models. These are insanely detailed, cover a lot of gotchas and caveats. A crispy read with well documented code, in depth discussions:
Tweet media one
6
129
692
@bhutanisanyam1
Sanyam Bhutani
4 years
If you're looking for the best resources to prepare for ML interviews in 2020, Here's a wiki from the @fastdotai forums: Also, all contributions are welcomed!
Tweet media one
6
155
681
@bhutanisanyam1
Sanyam Bhutani
6 years
"Start by learning the basics really well [..] Most advanced research projects require you to be excellent at the basics [..] @AndrewYNg always told me to work on thorough mastery of these basics" Read the complete interview w @goodfellow_ian @hackernoon :
4
185
662
@bhutanisanyam1
Sanyam Bhutani
10 months
NLP for absolute beginners 🙏 @jeremyphoward kindly shared the Stanford materials which are incredibly high signal resource for NLP. Here’s another tutorial teaching you the absolute NLP basics upto how to make a submission on Kaggle, by Jeremy himself!
Tweet media one
3
142
649
@bhutanisanyam1
Sanyam Bhutani
1 year
We can now train a 7B model from scratch on a single GPU! 🤯 DeepSpeed Chat: a framework offering insane optimisations and speed ups for training RLHF models: ✅ Efficient and more affordable ✅ Insane Scalability ✅ Easy to use scripts
Tweet media one
9
117
618
@bhutanisanyam1
Sanyam Bhutani
9 months
Simulating a software company with LLMs! 🚀 Remember the 25 agents living in a simulation? This does the same but for a software company ChatDev asks the questions around effectively getting Large Language Model agents collaborate on writing entire code bases: - Writing a
30
123
622
@bhutanisanyam1
Sanyam Bhutani
2 years
I'm tea-ry eyed. I can't believe this :') I've reached the @kaggle Grandmaster tier today! Thank you so much everyone!🍵 + My sincerest gratitude to @jeremyphoward for introducing me to Kaggle and to @vopani for pushing me to pursue it! 🙏
65
15
614
@bhutanisanyam1
Sanyam Bhutani
2 years
The kind people @NVIDIAAI sent a welcome gift 🙏
Tweet media one
21
11
583
@bhutanisanyam1
Sanyam Bhutani
7 months
The definitive guide to RAG in production! 🙏 @GokuMohandas walks us through implementing RAG from scratch, building a scalable app It now has updated discussion on embedding fine-tuning, re-ranking and effectively routing requests I think this is easily the most complete
Tweet media one
12
92
575
@bhutanisanyam1
Sanyam Bhutani
1 year
Outperforming LLMs with 2000x smaller models! 🚀 “Distilling Step-by-Step!” is an incredible paper showcasing the promise of using CoT prompting with LLMs to generate steps of logical thinking and high quality labels that can produce great smaller models: ✅ Outperforms both
Tweet media one
7
117
566
@bhutanisanyam1
Sanyam Bhutani
8 months
My favourite LLM blogs! 🙏 For your weekend learning, here’s an opinionated list of my favourite Large Language Model educators Pick any or all of their articles and read them cover to cover
Tweet media one
16
84
562
@bhutanisanyam1
Sanyam Bhutani
1 year
The Deep Learning book study group 🙏 Starting this Saturday, we will be going through the Bible for understanding the basics of DL 📚
Tweet media one
12
52
554
@bhutanisanyam1
Sanyam Bhutani
4 years
This is just surreal! I just won the @hackernoon Contributor of the year award for 2 categories! 🙏🍵 - Machine Learning - Tutorial
Tweet media one
Tweet media two
44
31
548
@bhutanisanyam1
Sanyam Bhutani
9 months
CS 25 has a great roadmap of LLM papers! 🙏 Transformers United has great guest lectures spanning the foundations of Large Language Models An underrated aspect of the course is the curated list of papers on every topic Perfect for your weekend reads:
Tweet media one
10
122
548
@bhutanisanyam1
Sanyam Bhutani
11 months
Combining Knowledge Graphs and LLMs! 🙏 To my surprise, this paper is extremely detailed around training strategies of building such models and goes beyond “just prompting” ChatGPT and GPT-4. In fact, I realised after reading-it doesn’t even mention these models in most of the
Tweet media one
24
80
549
@bhutanisanyam1
Sanyam Bhutani
2 years
I finally got my copy of Deep Learning w Python by @fchollet ! 📚 I couldn't be more excited about this 🙏 I'll be starting a reading group on Jan 8, and Francois has kindly agreed to join for an AMA! 🍵 Please send your Qs around Keras/the book as here! Links TBD Soon!
Tweet media one
23
29
533
@bhutanisanyam1
Sanyam Bhutani
10 months
Easily one of the biggest announcements for DL! 🙏 @fchollet announced Keras 3.0, a complete re-write making Keras the front end for TF, JAX and PyTorch. This means some amazing things. My mentor and core contributor @A_K_Nain gave me a rundown: - Unified framework: This is
Tweet media one
11
83
530
@bhutanisanyam1
Sanyam Bhutani
8 months
A 50 page book on LLM Agents! 🙏 This is my new favourite survey paper It reads like a perfect book on the why we need different techniques to make Large Language Model agents work and how different papers approached it
Tweet media one
9
96
523
@bhutanisanyam1
Sanyam Bhutani
10 months
The most detailed and practical write up on applying LLMs! 🙏 This reads like a survey paper but written for the industry and applications @eugeneyan is known as the best NLP writer for a reason. It’s the most comprehensive overview of patterns on building Large Language Models
Tweet media one
7
97
510
@bhutanisanyam1
Sanyam Bhutani
1 year
CS 25: Transformers United! 🦾 One of the best courses covering concepts of Transformers in the context of LLMs along with applications and secrets to building these models. My favourite part is the guest lectures from the best of our field!
Tweet media one
7
108
495
@bhutanisanyam1
Sanyam Bhutani
1 year
After all the travelling, I’ve invested my remaining savings into more 3090 GPUs for Kaggle 🙏
Tweet media one
31
13
496
@bhutanisanyam1
Sanyam Bhutani
2 years
I finally met my ML hero who’s taught me everything: @jeremyphoward 🙏
Tweet media one
12
8
503
@bhutanisanyam1
Sanyam Bhutani
9 months
A masterpiece on applying LLM agents! 🙏 MetaGPT paper is a golden treat on effectively applying Large Language Model agents. It takes inspiration from how humans work. Here’s my summary: - Assembly line: Every agent has a role assigned to it - Software Engineering: The above
Tweet media one
12
84
490
@bhutanisanyam1
Sanyam Bhutani
5 years
Personal Update Thread on The @GoogleAI Residency: Earlier this year I got a life-changing email. My Google AI Residency application had made it to the final interview rounds! This spring, Google flew me out to NYC where I gave my "On-site" interviews.
15
44
460
@bhutanisanyam1
Sanyam Bhutani
10 months
Annotated PyTorch papers! 🔥 @labmlai has the best resources for learning how to really implement ideas This is a no-nonsense website with a side-by-side implementation of papers in @PyTorch The transformers section is my fav:
Tweet media one
9
123
452
@bhutanisanyam1
Sanyam Bhutani
4 years
"Machine Learning doesn’t have to be a black box anymore. What use is a good model if we cannot explain the results to others. Interpretability is as important as creating a model." A neat kernel on "Intrepreting Machine Learning models" by @pandeyparul
5
99
444
@bhutanisanyam1
Sanyam Bhutani
3 years
I'm super excited to share that I've joined @weights_biases ! 🍵 I've been a fan of their community since the early days, I'm really looking forward to contributing to it further. Please expect study groups, events, Kaggle deep dives, and much more! 🙏
56
20
444
@bhutanisanyam1
Sanyam Bhutani
4 years
The mindset of "Completing an online course" isn't right: It's not a college degree-hacking your way to completion shouldn't be the goal and def won't be helpful Take your time, even build a project midway: Gaining knowledge & building Projects/Solving Prob should be the goal!
12
55
440
@bhutanisanyam1
Sanyam Bhutani
9 months
Incredible recap of key Transformer concepts! 🙏 What I really like about this write up is it covers 30 key papers and flows really well as a recap @lilianweng has written so many incredible posts, this one captures all key architectural concepts: - Transformer basics:
Tweet media one
6
74
433
@bhutanisanyam1
Sanyam Bhutani
8 months
The most underrated LLM Cookbook! 🙏 @OpenAI ’s guide is an incredibly underrated resource My favourite bit is the practical advice and guides sprinkled throughout the examples It also has the highest quality code of many learning resources The examples cover all important
Tweet media one
4
80
425
@bhutanisanyam1
Sanyam Bhutani
5 years
Great tutorial on Deploying Deep Learning Models On Web And Mobile (Along with a working demo!) by @reshamas and Nidhin P. They've used the library, but the tutorial can be used to create a web and mobile app using any framework
1
105
427
@bhutanisanyam1
Sanyam Bhutani
1 year
GitHub GPT: Understand any repository! 🚀 Here is a demo where I played with connecting GPT-4 to any repository. The main logic is <20 lines of code: ✅ @LangChainAI for the main logic ✅ @activeloopai for storing embeddings ✅ Simple App that runs in the terminal
15
62
423
@bhutanisanyam1
Sanyam Bhutani
10 months
I’m in happy tears to awarded “Top GenAI Scientist” award by @AnalyticsVidhya 🙏 I feel really honoured by the recognition. Will make this one count!
Tweet media one
31
9
422
@bhutanisanyam1
Sanyam Bhutani
1 year
A truly open source assistant chabot: GPT4All-J 🙏 A new model based was shipped to the GPT4All family. This one permits commercial usage and is completely open source: ✅ Model weights ✅ Training logs ✅ Training dataset
6
96
408
@bhutanisanyam1
Sanyam Bhutani
1 year
The NLP study group is back after a break 🙏 Today, I’ll explain and summarise the BloombergGPT paper. This was an incredible read since the authors have kindly shared a fair bit of model details along with the reasons for their architectural choices:
Tweet media one
5
46
409
@bhutanisanyam1
Sanyam Bhutani
8 months
Another great roadmap of LLM papers! 👌 CS224n has a really good curated list of papers to read for Large Language Models I would recommend starting with the papers before the slides and lectures:
Tweet media one
0
72
407
@bhutanisanyam1
Sanyam Bhutani
5 months
I’m writing a guide on building Multi-GPU machines! 🙏 Over the past few years, I’ve spent a lot of time learning how to build ML servers I’ve decided to write a guide on the topic What questions/topics would you want covered? TIA!
Tweet media one
58
37
398
@bhutanisanyam1
Sanyam Bhutani
3 months
A perfect intro to open source LLMs! 🙏 The course by @asangani7 is now my top recommendation for getting started with Large Language Models: - Just enough theory for a whole picture - Teaches prompting, special tokens and conversational agents - Perfectly abstracts the
Tweet media one
0
56
338
@bhutanisanyam1
Sanyam Bhutani
7 months
Efficient Deep Learning course! 👌 The lectures cover various techniques relevant to LLMs Happy Sunday learning:
Tweet media one
1
54
395
@bhutanisanyam1
Sanyam Bhutani
8 months
The best NLP lectures! 🙏 @chrmanning ’s latest CS224n lectures are finally live! The 14 hours of new content covers Large Language Models, Interpretability, and some crispy framework tutorials:
Tweet media one
4
76
393
@bhutanisanyam1
Sanyam Bhutani
2 years
My next goal: I will spend at least 500 hours this year competing on @kaggle 🍵 If I fail to do it, I will not drink chai for an entire year and giveaway all my GPUs 🙏
43
12
381
@bhutanisanyam1
Sanyam Bhutani
1 year
Studying the LLaMA paper at Little LLaMA Café 😋🙏
Tweet media one
17
7
382
@bhutanisanyam1
Sanyam Bhutani
10 months
The definitive guide to Multimodal deep learning! 🙏 Since the GPT-4 demo, multimodal has become one of the coolest domains in our field. This is a 240 page no-nonsense book to the domain, it starts from the basics of individual modalities upto the key details of the domain.
Tweet media one
5
72
381
@bhutanisanyam1
Sanyam Bhutani
10 months
Very practical course on applying LLMs! 🙏 @HamelHusain had mentioned that langchain makes for a great cookbook of cutting edge ideas. This course is a refreshingly applied one teaching how to use @LangChainAI to build different applications. My favourite part is it’s
Tweet media one
6
63
370
@bhutanisanyam1
Sanyam Bhutani
8 months
The most comprehensive series I’ve read on Vector databases! 💾 Most of us got exposed to vector dbs via Langchain or llamaindex documentation However, There’s a lot of nuance and options to select from when building Large Language Model apps @tech_optimist has written a 4
Tweet media one
9
59
372
@bhutanisanyam1
Sanyam Bhutani
1 year
If you’re interested in diving into ML research & understanding more papers this year: The Deep Learning book is an incredible resource teaching the basics & math behind DL I’ve created a 5 part series explaining chapters here:
Tweet media one
4
56
365
@bhutanisanyam1
Sanyam Bhutani
1 year
Terrific tutorial on fine-tuning LLMs to your own data 👨‍🔬 Tomas Bratnic has shared a really crispy write up on creating a Cypher generating LLM: ✅ All Open Source tools ✅ Walkthrough of setup ✅ Detailed steps on how to solve this @h2oai LLMStudio
5
66
360
@bhutanisanyam1
Sanyam Bhutani
9 months
Masterclass of Pythonic Thinking: PyTudes 🤌 Its the highest quality resource for learning “the Pythonic way” and problem solving The large number of problems cater to everyone at all levels Every revisit, there’s something new to learn
Tweet media one
2
76
362
@bhutanisanyam1
Sanyam Bhutani
2 years
Today is a glorious day for @kaggle community! 🍵 Kaggle legend: @sudalairajkumar has conquered all categories and become the newest 4x Grandmaster! 🙏
Tweet media one
11
19
357
@bhutanisanyam1
Sanyam Bhutani
1 year
“A cookbook of Self-Supervised Learning” @ylecun et al👩‍🍳👨‍🍳 SSL is the tasty sauce behind a lot of the success in Language models, Computer Vision and beyond. It permits working with limited data by allowing you to include unlabelled data in your workflow. Hence becoming “the
Tweet media one
5
64
341
@bhutanisanyam1
Sanyam Bhutani
1 year
NLP with Transformers Study Group 🤗 Starting next week, I’m hosting a study group on the absolute gem book by @huggingface team 🙏 @_lewtun has kindly agreed to join the kickoff session. I can’t think of a better way to learn NLP:
Tweet media one
10
48
337
@bhutanisanyam1
Sanyam Bhutani
8 months
AutoAgents: Autonomously generate LLM agents for any goal! 🤖 This tries to solve the need for strong prompting and role definition by autogenerating agents The code is sparsely documented but readable:
5
77
335
@bhutanisanyam1
Sanyam Bhutani
2 years
Weekly @kaggle Top Solutions Study group 🙏 Starting this Sunday, I will be going through top solutions of recently ended competitions that might be relevant to the ongoing ones:
Tweet media one
4
58
330
@bhutanisanyam1
Sanyam Bhutani
1 year
Am I doing @karpathy and chill right? Cafe overlooking Himalayas, tasty breakfast and lecture rewatch 😋🙏
Tweet media one
19
2
328
@bhutanisanyam1
Sanyam Bhutani
5 years
This is THE BEST CAREER ADVICE for Data Science that I’ve ever read: IMO @kaggle forums often have write ups/advice of *much* higher quality than most of the blogposts out there I’d highly recommend reading all of @ryan_chesler ’s write ups on Kaggle.
0
70
326
@bhutanisanyam1
Sanyam Bhutani
1 year
A hands on guide to train LLaMA with RLHF 🤗 It’s one of the most complete tutorials on the topic with detailed explanations around why and how to follow the fine-tuning approaches.
2
76
326
@bhutanisanyam1
Sanyam Bhutani
6 months
This really helped me understand why LLMs work! 🙏 - Why next word prediction is powerful - Why prompting works - What we know about emergence Thanks @_jasonwei for the gem:
Tweet media one
1
39
328
@bhutanisanyam1
Sanyam Bhutani
3 years
The next 30 days, everyday w/o exception, I will: - Wake up at 4 AM - Workout for 2 hr - Kaggle for 3 hr - Move onto other tasks after 10 AM Extra rules: - No 📱before 5 PM - No Emails before 12 PM I'll post a video announcing the micro-resolution tom
22
17
321
@bhutanisanyam1
Sanyam Bhutani
2 years
🧵 Top Kaggle solutions always feature many great insights and hidden details: I spent the past two days reading top solutions from the recently ended Great Barrier reef competition There were many fascinating tricks shared, my short summary TL;DR👇
4
49
318
@bhutanisanyam1
Sanyam Bhutani
1 year
This is best Prompt Engineering guide 🙏 @omarsar0 and team have kindly been curating a very extensive and complete guide on the topic. Like anything covered by @dair_ai , it’s really high quality: ✅ Introduction & Basics ✅ Zero-Shot, Few-Shot ✅ Chain of Thought ✅ ReAct
14
55
318
@bhutanisanyam1
Sanyam Bhutani
11 months
The first open source Financial LLM! 🚀 BloombergGPT was the first proprietary financial LLM. This week, we witness the first open source one. FinLLM takes a “data centric” approach towards finance by building on top of multiple APIs/resources. Here’s an overview of its
Tweet media one
4
48
313
@bhutanisanyam1
Sanyam Bhutani
1 year
My 15 day LLM Study Vacation! 🚀 The plan for next 2 weeks: ✅ Hike/Visit a Himalayan mountain daily with a paper to read ✅ Build 15 @LangChainAI apps ✅ Finish catching up on LLM research
Tweet media one
22
12
314
@bhutanisanyam1
Sanyam Bhutani
3 years
I just sat for 5 minutes straight smiling and trying not tear up after today's interview. It's done. I was able to record enough interviews to complete my dream goal of publishing 2 episodes-every Sunday and Thursday, at 9AM PT. No exceptions in 2020.
Tweet media one
20
9
313
@bhutanisanyam1
Sanyam Bhutani
9 months
An extremely crispy intro to Vector Databases! 🙏 Have you watched those wired videos explaining concepts at incremental levels of detail? @helloiamleonie done the same for Vector Dbs She teaches the topic using Feynman technique, in 3 levels of detail
Tweet media one
5
52
308
@bhutanisanyam1
Sanyam Bhutani
7 months
The hardest challenge I’ve done! 🙏 Last week, I completed 200 days of writing everyday about Large Language Models After ~2k hours of learning, I’m ready to make crispy videos:
Tweet media one
19
16
310
@bhutanisanyam1
Sanyam Bhutani
3 years
If you're looking for a code first NLP course 👨‍💻 There is an NLP course by @fastdotai covering 🕵️‍♂️ - What is NLP - Topic Modelling - Sentiment Classification - Regex - LM - RNNs - Transformers - Bias & Ethics Blog: YT Playlist:
5
64
302
@bhutanisanyam1
Sanyam Bhutani
9 months
Strong LLM blog recommendation! 🙏 For any practitioner interested in Large Language Models, this is the best blog @eugeneyan is magical at combining industrial patterns, experiments & research ideas very clearly His weekend exp are my fav read:
Tweet media one
3
40
304
@bhutanisanyam1
Sanyam Bhutani
8 months
This is my favourite type of tutorial! 🙏 Remember the awesome fastai tutorials that share only the necessary theory and quickly dive into applying it? @Sentdex teaches us QLoRA in the exact manner by applying it to give llama-2 more personality
Tweet media one
6
42
301
@bhutanisanyam1
Sanyam Bhutani
2 years
2022 Goals: 1 Workout for 350 hours 2 Compete on @kaggle for 200 Hours 3 Write 3 Kaggle Kernels 4 Host 50 meetups & interviews @weights_biases 5 Spend 500 hours reading 6 Write 4 High-Quality @PyTorch blogposts 7 Publish 1 Open Source Repo
11
15
295
@bhutanisanyam1
Sanyam Bhutani
1 year
ControlNet + @PyTorch 🔥 Many thanks to @nischay_twt for tag-teaming 🙏
8
30
288
@bhutanisanyam1
Sanyam Bhutani
1 year
150 interview with ML heroes 🙏 I’ve actively hosted interviews with the best Kagglers, Researchers and practitioners from 2019-22. The questions discuss the guest’s journey and approach in a timeless and educational way:
Tweet media one
7
50
292
@bhutanisanyam1
Sanyam Bhutani
4 years
Huge congratulations to @pandeyparul on becoming the 1st Woman @kaggle Kernels Grandmaster from India. And to the best of my knowledge, 2nd one in the world. Although, I really hope she won't stop sharing her amazing kernels with us🍵
Tweet media one
4
22
288
@bhutanisanyam1
Sanyam Bhutani
1 year
Tutorial on how to efficiently read papers 🙏 For today’s NLP study group, I will show you how to parse papers by going through the LLaMa paper 🦙 We’ll also learn text generation with 🤗 transformers
Tweet media one
2
40
287
@bhutanisanyam1
Sanyam Bhutani
1 year
Official guide on the GPT-4 API 👨‍🍳 @OpenAI cookbook has a set of crispy examples teaching how to build on the API. While, it doesn’t have a “0 to 100” flow, it’s worth going through in an evening. The code is very readable.
Tweet media one
3
48
286
@bhutanisanyam1
Sanyam Bhutani
10 months
The biggest LLM release!🚨 @AnthropicAI have given us the “dream” Large Language model with Claude 2: - Trained till early 2023 - Context length upto 200k tokens - Ability to upload documents - Better code abilities - Free beta for public Please see my demo below of comparing
5
48
285
@bhutanisanyam1
Sanyam Bhutani
1 year
20 really high quality channels to learn ML & Python from 📚👨‍💻 This is a list of my favourite creators that produce high signal technical education content:
Tweet media one
12
50
285
@bhutanisanyam1
Sanyam Bhutani
1 year
This was another gem of advice by @karpathy 🙏
Tweet media one
3
22
279