serious_mehta Profile Banner
Tanay Mehta Profile
Tanay Mehta

@serious_mehta

Followers
10K
Following
39K
Media
440
Statuses
6K

AI eng @Aleph__Alpha | prev: Postgraduate @UniofBath | @Kaggle Notebooks Grandmaster | in the CUDA trenches | opinions are my own

Joined July 2020
Don't wanna be here? Send us removal request.
@serious_mehta
Tanay Mehta
4 months
are people actually so stupid? what do they think Deepseek-r1 was trained on? potato batteries?.
@spectatorindex
The Spectator Index
4 months
BREAKING: NVIDIA is now down 10% in premarket trading.
1K
2K
47K
@serious_mehta
Tanay Mehta
4 months
@tugot17 everytime someone sells a share of TSMC, the average IQ dips by 2 points.
20
41
3K
@serious_mehta
Tanay Mehta
4 months
@Samhanknr Oh that I agree with, but people selling $NVDA because they think Deepseek somehow bypassed GPU requirement is hilarious. If anything it shows that $NVDA should be more valued because even with export ban, Chinese companies are able to get their hands on Nvidia chips.
65
43
2K
@serious_mehta
Tanay Mehta
2 years
@stats_feed 1. Parts of China were colonies of several European nations.2. Nepal and Bhutan, though not directly controlled by Britain, they were British Protectorates.3. Liberia was an American colony (so technically not Europe but they were colonised).4. Mongolia was under the control of.
85
31
1K
@serious_mehta
Tanay Mehta
16 days
I think one mistake I made when learning ML during my undergrad was that I had put far too much focus and time on implementing obsolete ML algorithms from scratch instead of using that time to focus on learning ML performance optimisation and CUDA.
18
62
1K
@serious_mehta
Tanay Mehta
2 months
I scored 7 marks in JEE Mains in 2018 and ended up going to a tier-3 private university no one had ever heard of. Most people around me thought I would end up in a 7 LPA job somewhere, if I am lucky. I just kept solving problems, one at a time. Never kill yourself.
@Some1UKnow25
Raghav
2 months
I failed JEE badly my rank was like 3lakh smth. I haven't achieved much till now but surely have done quite well (sih24 win, 2x internships) & life is going pretty well & I know I will figure everything out eventually. This JEE race has completely hollowed down the Indian youth
Tweet media one
30
46
991
@serious_mehta
Tanay Mehta
1 year
I don't wanna get a PhD but wanna work as a Machine Learning Engineer. Dilemma of the Century.
73
45
808
@serious_mehta
Tanay Mehta
11 months
I’m convinced ML math is easier than dating in this day and age lol.
48
48
759
@serious_mehta
Tanay Mehta
9 months
Done with my Masters, secured a job, and now waiting for a visa which means I am now on a well-deserved vacation after an year of non-stop grind. Problem is, I am so addicted to working towards something that I now dread vacations, and it only took me 6 hours to realise that.
23
8
612
@serious_mehta
Tanay Mehta
2 months
to all the new folks who followed me in the last two days - sorry lads, there won't be any JEE rant posting . I am a 24 yo ai engineer who has long grown out of my JEE-hate era (sorry to disappoint?). however, do stick around if you are into AI, CompSci and low-level sys eng.
6
7
573
@serious_mehta
Tanay Mehta
3 years
I never for once imagined that I'll be writing these lines on a piece of paper but apparently Indian CS degree had other plans in mind.
Tweet media one
47
24
562
@serious_mehta
Tanay Mehta
3 years
Indian colleges should also have Open-Source contribution as an option for the Final Year major project. Like say, the rules can be that the Open-Source project should have more than 100 stars and the contribution should be of more than 100 lines of code and should be merged.
37
45
534
@serious_mehta
Tanay Mehta
2 years
@UpdatingOnRome Oh I miss those times when the Germanic people fought with the Romans over the control of Moscow.
2
2
506
@serious_mehta
Tanay Mehta
2 years
@O42nl Imagine multiplying matrices like hell, left and right, only for the results to start with, “As a Large Language Model trained by OpenAI, I cannot…”.
9
12
481
@serious_mehta
Tanay Mehta
1 year
What I read: Globalisation is making Indian youth realise they deserve more which is causing considerable concern among the Indian companies who now can’t exploit the gullible for laughable pay, even by Indian standards.
@sadaashree
Lone Wolf Ratnakar
1 year
Interviewing candidates for our firm, most of them demanding very high salaries. Now that is not an issue, if these people were extraordinarily brilliant, or some IIT, NIT passout. Most of them are from ordinary engineering college, and forget about being extraordinary, they are.
0
52
474
@serious_mehta
Tanay Mehta
3 years
Is there a Linux distribution that comes pre-installed with all Data Science and Machine Learning libraries and Conda and everything exactly in place?. The aim would be to resume working on your ML work within 30 minutes of installing the OS. I would honestly pay for that.
60
40
470
@serious_mehta
Tanay Mehta
9 months
Came in the mail today. Time to be CUDA-Cracked
Tweet media one
20
14
447
@serious_mehta
Tanay Mehta
1 year
It’s currently 1.53 AM and I am reading the RWKV paper at Abu Dhabi International Airport waiting for my connecting flight. The grind will not stop, not even at midnight.
Tweet media one
22
7
443
@serious_mehta
Tanay Mehta
8 days
@hkproj I am sorry Umar, but you are wrong here. I can live all I want for my "ideals" until a terrorist comes to my home, asks me to remove my pants and then shoot me because I am a Hindu in front of my loved ones. My maternal great grandfather was killed the same way by Pakistani.
4
8
454
@serious_mehta
Tanay Mehta
3 years
Did I just get a 10 GPA in my final semester of CS??? 🤯🤯
Tweet media one
48
9
421
@serious_mehta
Tanay Mehta
11 months
Gonna focus *solely* on getting better at ML engineering and core CS for the next three months until my masters graduation in early October. It's time to be very, very cracked. Also, looking forward to chatting with other cool MLEs / research engineers in the domain!.
11
8
387
@serious_mehta
Tanay Mehta
1 year
Should I make a YouTube channel where I share, among other things, how to contribute to Open Source Machine Learning projects + other Machine Learning and NLP knowledge?.
46
8
361
@serious_mehta
Tanay Mehta
2 years
@stats_feed source: trust me bro.
11
0
354
@serious_mehta
Tanay Mehta
3 years
Practical ML Idea: An ML model that can generate a good “Subject” based on the email's body. I don't know why, but it's always a pain for me to write good email subjects.
24
21
340
@serious_mehta
Tanay Mehta
2 years
@Gerashchenko_en Buy 1 - Get 1 FREE.
2
8
299
@serious_mehta
Tanay Mehta
3 years
If you, like me, get turned on by heavily Optimized and memory-efficient PyTorch code then you are in luck because I am in process of writing a thread on how to write super-duper efficient PyTorch code that can run pretty large models on Colab & Kaggle Kernels 🙃.
14
18
331
@serious_mehta
Tanay Mehta
4 months
@Ashwin_S18 Quite the opposite, no?.If even one independent company replicates Deepseek-r1 and verifies a training cost in under $6M, this will mean even lesser funded startups can compete. They may even be able to distill the RL trained models too (not the SFT models like Deepseek has done).
17
4
302
@serious_mehta
Tanay Mehta
2 years
@HSajwanization No it’s not?! Our talent, our money, our engineering. Period.
1
1
274
@serious_mehta
Tanay Mehta
3 years
This morning, I became a Kaggle Grandmaster ✨🥺. Looking forward to continuing my work on making informative training and inference kernels in different competitions and datasets and helping the community 🚀. Also, I don't think someone has ever become a GM at #69th rank 😅
Tweet media one
26
4
288
@serious_mehta
Tanay Mehta
3 years
Github Co-pilot for Jupyter Notebooks, please!!.
13
17
277
@serious_mehta
Tanay Mehta
11 months
Machine learning mustn’t stop, not even in empty university halls during vacations. Build the future or die trying.
Tweet media one
9
10
273
@serious_mehta
Tanay Mehta
4 years
ML Research Idea: Model that can annotate and explain ML papers.
9
27
262
@serious_mehta
Tanay Mehta
3 years
I turned down an offer from a company in ML and AI space recently. Why? Because despite being Open Source itself, one of the conditions was that I will not be allowed to do Open Source contributions anywhere else while I am an employee. Seriously?. Not trading my freedom!.
17
10
250
@serious_mehta
Tanay Mehta
3 years
Let's make 2022 the year for Open Source Contributions 🚀.
8
15
240
@serious_mehta
Tanay Mehta
2 years
@TheAnkurTyagi No amount of experience justifies a 3 LPA job in India in 2023. A roadside pani puri seller earns way more than that. For the love of god please don’t justify these low paying exploitative jobs and toxic work culture as “great potential to grow”.
6
4
231
@serious_mehta
Tanay Mehta
3 years
Kept this under the wraps for some time but saying this out loud now: I got acceptance for a Master in CS from a German TU last week 🥺🌟.
45
2
243
@serious_mehta
Tanay Mehta
2 years
Here you go! I have published my GPT training notebook on kaggle. It features a *new* way of Data loading using PyTorch data loaders and is powered by @LightningAI for quick, clean and elegant model training along with @weights_biases logging!.
4
35
241
@serious_mehta
Tanay Mehta
15 days
They are gonna ruin entire batches of Comp Sci students aren't they. Students SHOULD learn to code and reason by themselves to develop the critical thinking ability. Now they will be writing code they don't themselves understand.
@cursor_ai
Cursor
16 days
Cursor is now free for students. Enjoy!.
8
6
243
@serious_mehta
Tanay Mehta
3 years
Super happy to announce that I just became an Open Source contributor @huggingface transformers🤗. I have added the Poolformer model (from paper: "Metaformer is actually all you need for Vision") by Sea AI Labs. Example snippet down below 👇
Tweet media one
16
22
230
@serious_mehta
Tanay Mehta
1 year
When Machine Learning meets esoteric Hinduism.
@nonvonmon
computerist
1 year
Tweet media one
1
12
221
@serious_mehta
Tanay Mehta
3 years
If you are someone who wants to start reading Research papers in ML and looking for some motivation to start, then I have something for you!. Here's the Notion database of the papers I've read with their summaries. You can follow my learning journey!.
8
41
229
@serious_mehta
Tanay Mehta
1 year
Now I have become SGD, accelerator of negative gradient down the slope
Tweet media one
8
11
215
@serious_mehta
Tanay Mehta
3 years
I recently switched to a MacBook Pro and I am already so surprised by its battery life!?. 5 Chrome tabs, including Youtube playing music for about an hour, and my battery is down from 100% to 98%??.What sorcery is it?.
25
5
217
@serious_mehta
Tanay Mehta
1 year
Got my acceptance at Oxford Machine Learning Summer School 2024, in-person mode!. Can’t wait to meet you all amazing people this Summer at Oxford 🚀
Tweet media one
14
2
203
@serious_mehta
Tanay Mehta
3 years
Just found out that @kaggle has actually included one of my notebooks that used Jax + @huggingface transformers + @weights_biases tracking for Sentiment Classification as one of the example notebooks for the upcoming Google Open-Source Expert Prize!.
14
7
202
@serious_mehta
Tanay Mehta
7 months
When fine-tuning an LM with newly added tokens, set the new token embedding to be the average of existing embeddings, which bounds the KL-divergence. This will make fine-tuning smoother. I added this in my last @huggingface transformers contribution:.
11
13
196
@serious_mehta
Tanay Mehta
3 months
@deepseek_ai You see boys, THIS is Open AI.
2
2
191
@serious_mehta
Tanay Mehta
3 years
-> Got out of post-COVID depression.-> Became a Kaggle Master and now in top-100 in the category.-> Bagged an ML Internship from NVIDIA.-> Improved my Grade point average by .4 points in one go.-> Got featured in my college's annual tech newsletter for the 3rd consecutive time!.
13
6
174
@serious_mehta
Tanay Mehta
6 months
@therealnaomib Dumbest take I have read this week. “Have high pollution, stop growing” is like saying “Have trouble understanding maths, start solving easier problems”.
1
1
171
@serious_mehta
Tanay Mehta
2 years
When you have no clue what LLMs actually are and what we mean by parameters but you still tweet —.
@danapke
Daniel Apke
2 years
@nhutter28 Here is a scary look at where we are vs the knowledge GPT 4 will have.
Tweet media one
13
8
179
@serious_mehta
Tanay Mehta
2 years
@latestinspace Just imagine how big a star that once was if it "collapsed" and can still fit 30 Billion suns!.
8
3
171
@serious_mehta
Tanay Mehta
4 years
Finally, I am a Research Intern!.
0
0
171
@serious_mehta
Tanay Mehta
3 years
Got offered an ML Intern position at a non-Indian company. How much are they gonna pay me a month for 40 hours per week? . $225. Yup. My (remarkably excellent) plumbing guy makes more than that 🤯. Workplace exploitation is real folks.
13
5
162
@serious_mehta
Tanay Mehta
2 years
Officer on UK Border Immigration yesterday: “…So you have worked in AI right? Do you think we can deploy AI on Blockchain?”. Me: *proceeds to explain him how Blockchain, GPT and generally LLMs work for 20 straight mins*. Him: “😳 Have a fantastic education, sir! *stamped*”. did.
11
2
164
@serious_mehta
Tanay Mehta
1 year
Announcing the LLM Adventures Notebook series on @kaggle, where I will be making notebooks on various interesting use-cases of LLMs and RAG pipelines using Open LLMs and datasets from Kaggle ✨. Check it out:
Tweet media one
1
26
164
@serious_mehta
Tanay Mehta
11 months
Time to be cracked af, re-starting the Linear Algebra and Stats playlists from MIT OCW to go back to the basics.
3
1
163
@serious_mehta
Tanay Mehta
1 year
One of those days
Tweet media one
5
1
151
@serious_mehta
Tanay Mehta
4 months
Graduated yesterday with a distinction and stuff
Tweet media one
16
0
158
@serious_mehta
Tanay Mehta
1 year
I remember learning CUDA C like 2 years ago and then never using it after my internship at NVIDIA was done. Biggest mistake. Now I am going to learn it again (because ✨TRITON✨) and it feels like starting from scratch :(.
6
1
154
@serious_mehta
Tanay Mehta
2 years
So technically I have worked at FAANG?!.
@mbdailyshow
Morning Brew Daily
2 years
How it started How it's going
Tweet media one
Tweet media two
2
1
147
@serious_mehta
Tanay Mehta
2 years
The notebook that followed my talk in London is now out!. If you want to understand, code and train your own GPT, take a look at it! Modify it, pull it apart and change it as you see fit 🚀. The data loading part is now chill thanks to @lancedb!.
2
35
149
@serious_mehta
Tanay Mehta
3 years
I've published yet another PyTorch training notebook in the AI4Code competition on #Kaggle. This one's using Microsoft's CodeBERT model (thanks to @huggingface🤗). It includes optimizations, a trainer module and @weights_biases logging & exp. tracking 🚀.
1
17
146
@serious_mehta
Tanay Mehta
3 years
Update on an experiment I did a month ago: I asked here on twitter if I should add a separate "Open Source Contributions" section on my CV, which I did. Of the 3 companies I had applied to with this new CV, I received an initial interview call from 2. Success!. cc @amuldotexe.
5
4
145
@serious_mehta
Tanay Mehta
2 years
Thrilled to announce that I will be joining @tuBraunschweig as a Masters's Student in Data Science for the Summer Semester of 2023!. Can't wait to move to Germany and embark on this new journey 🚀
Tweet media one
22
0
142
@serious_mehta
Tanay Mehta
1 year
Although the video is coming soon; if you want to pre-train or fine-tune the Mamba model as a Code-completion LLM (like Github Copilot) using a Lance dataset, I have created a repository with training scripts for you 🚀. Only supports single GPU for now .
4
19
135
@serious_mehta
Tanay Mehta
2 years
@Ravisutanjani I don’t know what’s worse, people doing the actual thing or the ones justifying it in this comment section.
3
6
129
@serious_mehta
Tanay Mehta
1 year
Oh Istanbul, you have my heart ❤️🇹🇷
Tweet media one
Tweet media two
Tweet media three
Tweet media four
12
3
134
@serious_mehta
Tanay Mehta
4 years
Started my transition from PyTorch to Jax-Flax. Hoping to complete this transition in a few months. Also, will soon start pushing dominantly JAX-based TPU notebooks on Kaggle, details will follow!.
5
6
135
@serious_mehta
Tanay Mehta
4 years
Ok hear me out: "Code Reading Groups". Just like Paper Reading groups, except we collectively read and try to make sense of big opensource projects like Tensorflow and PyTorch. I'm sure I'm not the only one who tried reading code from such a repo and couldn't understand anything.
17
4
130
@serious_mehta
Tanay Mehta
3 years
My Pull Request for adding the Hinge Loss function to @DeepMind's Optax has been merged today! Going to add many more loss functions to Optax (for all you JAX geeks out there 😉)
Tweet media one
6
3
131
@serious_mehta
Tanay Mehta
3 years
Final exams of my engineering degree are over. Tanay is a free elf now 🌚.
12
0
123
@serious_mehta
Tanay Mehta
2 years
OMG! Kaggle Models is finally a thing 😍. I remember talking to @Rob13Ell a few months ago where we brainstormed a lot of interesting ways this could turn out and it's so fun seeing it live after all!. Job well-done @kaggle team 👏🏼
Tweet media one
5
11
126
@serious_mehta
Tanay Mehta
8 months
EU Blue Card visa 🇪🇺 issued, moving to Berlin 🇩🇪 in a couple weeks!.
11
3
124
@serious_mehta
Tanay Mehta
1 year
life update: reinforcement learning assignment has me coding 12+ hours everyday for the last week. what even is this.
13
0
119
@serious_mehta
Tanay Mehta
2 years
It's official: I am now a Computer Science graduate! Time to celebrate with a slice of pizza and a well-deserved Netflix marathon🍕 . But seriously, all jokes aside, I'm grateful to have made it through this program and can't wait to see what the future holds 🚀
Tweet media one
13
1
121
@serious_mehta
Tanay Mehta
2 years
Been writing CUDA kernels since 5 AM, I can't see straight anymore someone please send help.
10
1
117
@serious_mehta
Tanay Mehta
2 years
What's common is that they all left India to make a better life for themselves because our terrible politics, workplace exploitation, casteism and misogny won't let them build one here.
@varinder_bansal
Varinder Bansal 🇮🇳
2 years
What’s common???. CEO of Google .CEO of Microsoft.CEO of Adobe.CEO of YouTube.CEO of Mastercard.CEO of Pepsi.CEO of IBM.CEO of Netapp.CEO of Nokia.CEO of Novartis .CEO of Deloitte.
7
6
118
@serious_mehta
Tanay Mehta
3 years
The real reason why I am doing Open source contributions aggressively is so that I can have all these organizations as infinity stones xD
Tweet media one
2
2
111
@serious_mehta
Tanay Mehta
1 year
Sundays are for creating datasets using @lancedb on @LightningAI Studio so they can be released on Monday 🚀⚡️
Tweet media one
4
7
107
@serious_mehta
Tanay Mehta
1 year
The grind stoppeth not. Open Source ML 🙌🏻
Tweet media one
6
1
99
@serious_mehta
Tanay Mehta
2 years
@JohnArnoldFndtn “that’s how an RBMK reactor explodes”.
3
0
97
@serious_mehta
Tanay Mehta
1 year
@cmizzy1 Well for starters, we weren’t going through major historical events like every other week.
3
0
95
@serious_mehta
Tanay Mehta
5 months
A weekend in Paris and I want to visit again but for longer. Maybe meet with ML community here next time!
Tweet media one
Tweet media two
Tweet media three
Tweet media four
3
2
96
@serious_mehta
Tanay Mehta
11 months
@WagmanSam It’s the Chamber of Secrets if you ordered it from Temu.
0
0
91
@serious_mehta
Tanay Mehta
4 years
Twitter fam, I need suggestion for an Ubuntu based distribution! .I am going to wipe Windows 11 this weekend and install an Ubuntu based distro so please suggest your personal favorites!. P.S: No, I don't need to try Arch Linux, Linux Mint or Vanilla Debian, please stay away 😂.
66
4
97
@serious_mehta
Tanay Mehta
4 years
Writing PyTorch training pipelines is something I will *never* get bored of.
6
2
91
@serious_mehta
Tanay Mehta
2 months
@Txz67 I was working for US and Israeli startups when I was in India during my undergrad. And I paid for much of my foreign college tuition with the money I saved from working for the startups during Undergrad and Postgrad. Try harder.
3
3
98
@serious_mehta
Tanay Mehta
4 years
Staying up all night in a hackathon, coding with people while interacting and making friends was such a nice feeling. It all feels like a long time ago. Honestly, if you are not on a video call with your team, staying up at night in an online hackathon, you are missing out!.
5
6
93
@serious_mehta
Tanay Mehta
3 years
These Product design and UI/UX people are pure artists 💯.
6
11
87
@serious_mehta
Tanay Mehta
4 years
We. Need. More. Jax. Tutorials.
5
1
95
@serious_mehta
Tanay Mehta
3 years
Everyone! Quickly enjoy your life, GITHUB IS DOWN. I REPEAT GITHUB IS DOWN
Tweet media one
4
22
92
@serious_mehta
Tanay Mehta
12 days
@sidhant @CapitalsLad Me and him are not that different.
1
0
92