Tanay Mehta @serious_mehta profile

Tanay Mehta

@serious_mehta

Followers

10K

Following

39K

Media

440

Statuses

6K

AI eng @Aleph__Alpha | prev: Postgraduate @UniofBath | @Kaggle Notebooks Grandmaster | in the CUDA trenches | opinions are my own

Joined July 2020

Don't wanna be here? Send us removal request.

Tanay Mehta

@serious_mehta

4 months

are people actually so stupid? what do they think Deepseek-r1 was trained on? potato batteries?.

The Spectator Index

@spectatorindex

4 months

BREAKING: NVIDIA is now down 10% in premarket trading.

1K

2K

47K

Tanay Mehta

@serious_mehta

4 months

@tugot17 everytime someone sells a share of TSMC, the average IQ dips by 2 points.

20

41

3K

Tanay Mehta

@serious_mehta

4 months

@Samhanknr Oh that I agree with, but people selling $NVDA because they think Deepseek somehow bypassed GPU requirement is hilarious. If anything it shows that $NVDA should be more valued because even with export ban, Chinese companies are able to get their hands on Nvidia chips.

65

43

2K

Tanay Mehta

@serious_mehta

2 years

@stats_feed 1. Parts of China were colonies of several European nations.2. Nepal and Bhutan, though not directly controlled by Britain, they were British Protectorates.3. Liberia was an American colony (so technically not Europe but they were colonised).4. Mongolia was under the control of.

85

31

1K

Tanay Mehta

@serious_mehta

16 days

I think one mistake I made when learning ML during my undergrad was that I had put far too much focus and time on implementing obsolete ML algorithms from scratch instead of using that time to focus on learning ML performance optimisation and CUDA.

18

62

1K

Tanay Mehta

@serious_mehta

2 months

I scored 7 marks in JEE Mains in 2018 and ended up going to a tier-3 private university no one had ever heard of. Most people around me thought I would end up in a 7 LPA job somewhere, if I am lucky. I just kept solving problems, one at a time. Never kill yourself.

Raghav

@Some1UKnow25

2 months

I failed JEE badly my rank was like 3lakh smth. I haven't achieved much till now but surely have done quite well (sih24 win, 2x internships) & life is going pretty well & I know I will figure everything out eventually. This JEE race has completely hollowed down the Indian youth

30

46

991

Tanay Mehta

@serious_mehta

1 year

I don't wanna get a PhD but wanna work as a Machine Learning Engineer. Dilemma of the Century.

73

45

808

Tanay Mehta

@serious_mehta

11 months

I’m convinced ML math is easier than dating in this day and age lol.

48

759

Tanay Mehta

@serious_mehta

9 months

Done with my Masters, secured a job, and now waiting for a visa which means I am now on a well-deserved vacation after an year of non-stop grind. Problem is, I am so addicted to working towards something that I now dread vacations, and it only took me 6 hours to realise that.

23

8

612

Tanay Mehta

@serious_mehta

2 months

to all the new folks who followed me in the last two days - sorry lads, there won't be any JEE rant posting . I am a 24 yo ai engineer who has long grown out of my JEE-hate era (sorry to disappoint?). however, do stick around if you are into AI, CompSci and low-level sys eng.

6

7

573

Tanay Mehta

@serious_mehta

3 years

I never for once imagined that I'll be writing these lines on a piece of paper but apparently Indian CS degree had other plans in mind.

47

24

562

Tanay Mehta

@serious_mehta

3 years

Indian colleges should also have Open-Source contribution as an option for the Final Year major project. Like say, the rules can be that the Open-Source project should have more than 100 stars and the contribution should be of more than 100 lines of code and should be merged.

37

45

534

Tanay Mehta

@serious_mehta

2 years

@UpdatingOnRome Oh I miss those times when the Germanic people fought with the Romans over the control of Moscow.

2

506

Tanay Mehta

@serious_mehta

2 years

@O42nl Imagine multiplying matrices like hell, left and right, only for the results to start with, “As a Large Language Model trained by OpenAI, I cannot…”.

9

12

481

Tanay Mehta

@serious_mehta

1 year

What I read: Globalisation is making Indian youth realise they deserve more which is causing considerable concern among the Indian companies who now can’t exploit the gullible for laughable pay, even by Indian standards.

Lone Wolf Ratnakar

@sadaashree

1 year

Interviewing candidates for our firm, most of them demanding very high salaries. Now that is not an issue, if these people were extraordinarily brilliant, or some IIT, NIT passout. Most of them are from ordinary engineering college, and forget about being extraordinary, they are.

0

52

474

Tanay Mehta

@serious_mehta

3 years

Is there a Linux distribution that comes pre-installed with all Data Science and Machine Learning libraries and Conda and everything exactly in place?. The aim would be to resume working on your ML work within 30 minutes of installing the OS. I would honestly pay for that.

60

40

470

Tanay Mehta

@serious_mehta

9 months

Came in the mail today. Time to be CUDA-Cracked

20

14

447

Tanay Mehta

@serious_mehta

1 year

It’s currently 1.53 AM and I am reading the RWKV paper at Abu Dhabi International Airport waiting for my connecting flight. The grind will not stop, not even at midnight.

22

7

443

Tanay Mehta

@serious_mehta

8 days

@hkproj I am sorry Umar, but you are wrong here. I can live all I want for my "ideals" until a terrorist comes to my home, asks me to remove my pants and then shoot me because I am a Hindu in front of my loved ones. My maternal great grandfather was killed the same way by Pakistani.

4

8

454

Tanay Mehta

@serious_mehta

3 years

Did I just get a 10 GPA in my final semester of CS??? 🤯🤯

48

9

421

Tanay Mehta

@serious_mehta

11 months

Gonna focus *solely* on getting better at ML engineering and core CS for the next three months until my masters graduation in early October. It's time to be very, very cracked. Also, looking forward to chatting with other cool MLEs / research engineers in the domain!.

11

8

387

Tanay Mehta

@serious_mehta

1 year

Should I make a YouTube channel where I share, among other things, how to contribute to Open Source Machine Learning projects + other Machine Learning and NLP knowledge?.

46

8

361

Tanay Mehta

@serious_mehta

2 years

@stats_feed source: trust me bro.

11

0

354

Tanay Mehta

@serious_mehta

3 years

Practical ML Idea: An ML model that can generate a good “Subject” based on the email's body. I don't know why, but it's always a pain for me to write good email subjects.

24

21

340

Tanay Mehta

@serious_mehta

2 years

@Gerashchenko_en Buy 1 - Get 1 FREE.

2

8

299

Tanay Mehta

@serious_mehta

3 years

If you, like me, get turned on by heavily Optimized and memory-efficient PyTorch code then you are in luck because I am in process of writing a thread on how to write super-duper efficient PyTorch code that can run pretty large models on Colab & Kaggle Kernels 🙃.

14

18

331

Tanay Mehta

@serious_mehta

4 months

@Ashwin_S18 Quite the opposite, no?.If even one independent company replicates Deepseek-r1 and verifies a training cost in under $6M, this will mean even lesser funded startups can compete. They may even be able to distill the RL trained models too (not the SFT models like Deepseek has done).

17

4

302

Tanay Mehta

@serious_mehta

2 years

@HSajwanization No it’s not?! Our talent, our money, our engineering. Period.

1

274

Tanay Mehta

@serious_mehta

3 years

This morning, I became a Kaggle Grandmaster ✨🥺. Looking forward to continuing my work on making informative training and inference kernels in different competitions and datasets and helping the community 🚀. Also, I don't think someone has ever become a GM at #69th rank 😅

26

4

288

Tanay Mehta

@serious_mehta

3 years

Github Co-pilot for Jupyter Notebooks, please!!.

13

17

277

Tanay Mehta

@serious_mehta

11 months

Machine learning mustn’t stop, not even in empty university halls during vacations. Build the future or die trying.

9

10

273

Tanay Mehta

@serious_mehta

4 years

ML Research Idea: Model that can annotate and explain ML papers.

9

27

262

Tanay Mehta

@serious_mehta

3 years

I turned down an offer from a company in ML and AI space recently. Why? Because despite being Open Source itself, one of the conditions was that I will not be allowed to do Open Source contributions anywhere else while I am an employee. Seriously?. Not trading my freedom!.

17

10

250

Tanay Mehta

@serious_mehta

3 years

Let's make 2022 the year for Open Source Contributions 🚀.

8

15

240

Tanay Mehta

@serious_mehta

2 years

@TheAnkurTyagi No amount of experience justifies a 3 LPA job in India in 2023. A roadside pani puri seller earns way more than that. For the love of god please don’t justify these low paying exploitative jobs and toxic work culture as “great potential to grow”.

6

4

231

Tanay Mehta

@serious_mehta

3 years

Kept this under the wraps for some time but saying this out loud now: I got acceptance for a Master in CS from a German TU last week 🥺🌟.

45

2

243

Tanay Mehta

@serious_mehta

2 years

Here you go! I have published my GPT training notebook on kaggle. It features a *new* way of Data loading using PyTorch data loaders and is powered by @LightningAI for quick, clean and elegant model training along with @weights_biases logging!.

4

35

241

Tanay Mehta

@serious_mehta

15 days

They are gonna ruin entire batches of Comp Sci students aren't they. Students SHOULD learn to code and reason by themselves to develop the critical thinking ability. Now they will be writing code they don't themselves understand.

Cursor

@cursor_ai

16 days

Cursor is now free for students. Enjoy!.

8

6

243

Tanay Mehta

@serious_mehta

3 years

Super happy to announce that I just became an Open Source contributor @huggingface transformers🤗. I have added the Poolformer model (from paper: "Metaformer is actually all you need for Vision") by Sea AI Labs. Example snippet down below 👇

16

22

230

Tanay Mehta

@serious_mehta

1 year

When Machine Learning meets esoteric Hinduism.

computerist

@nonvonmon

1 year

1

12

221

Tanay Mehta

@serious_mehta

3 years

If you are someone who wants to start reading Research papers in ML and looking for some motivation to start, then I have something for you!. Here's the Notion database of the papers I've read with their summaries. You can follow my learning journey!.

8

41

229

Tanay Mehta

@serious_mehta

1 year

Now I have become SGD, accelerator of negative gradient down the slope

8

11

215

Tanay Mehta

@serious_mehta

3 years

I recently switched to a MacBook Pro and I am already so surprised by its battery life!?. 5 Chrome tabs, including Youtube playing music for about an hour, and my battery is down from 100% to 98%??.What sorcery is it?.

25

5

217

Tanay Mehta

@serious_mehta

1 year

Got my acceptance at Oxford Machine Learning Summer School 2024, in-person mode!. Can’t wait to meet you all amazing people this Summer at Oxford 🚀

14

2

203

Tanay Mehta

@serious_mehta

3 years

Just found out that @kaggle has actually included one of my notebooks that used Jax + @huggingface transformers + @weights_biases tracking for Sentiment Classification as one of the example notebooks for the upcoming Google Open-Source Expert Prize!.

14

7

202

Tanay Mehta

@serious_mehta

7 months

When fine-tuning an LM with newly added tokens, set the new token embedding to be the average of existing embeddings, which bounds the KL-divergence. This will make fine-tuning smoother. I added this in my last @huggingface transformers contribution:.

11

13

196

Tanay Mehta

@serious_mehta

3 months

@deepseek_ai You see boys, THIS is Open AI.

2

191

Tanay Mehta

@serious_mehta

3 years

-> Got out of post-COVID depression.-> Became a Kaggle Master and now in top-100 in the category.-> Bagged an ML Internship from NVIDIA.-> Improved my Grade point average by .4 points in one go.-> Got featured in my college's annual tech newsletter for the 3rd consecutive time!.

13

6

174

Tanay Mehta

@serious_mehta

6 months

@therealnaomib Dumbest take I have read this week. “Have high pollution, stop growing” is like saying “Have trouble understanding maths, start solving easier problems”.

1

171

Tanay Mehta

@serious_mehta

2 years

When you have no clue what LLMs actually are and what we mean by parameters but you still tweet —.

Daniel Apke

@danapke

2 years

@nhutter28 Here is a scary look at where we are vs the knowledge GPT 4 will have.

13

8

179

Tanay Mehta

@serious_mehta

2 years

@latestinspace Just imagine how big a star that once was if it "collapsed" and can still fit 30 Billion suns!.

8

3

171

Tanay Mehta

@serious_mehta

4 years

Finally, I am a Research Intern!.

0

171

Tanay Mehta

@serious_mehta

3 years

Got offered an ML Intern position at a non-Indian company. How much are they gonna pay me a month for 40 hours per week? . $225. Yup. My (remarkably excellent) plumbing guy makes more than that 🤯. Workplace exploitation is real folks.

13

5

162

Tanay Mehta

@serious_mehta

2 years

Officer on UK Border Immigration yesterday: “…So you have worked in AI right? Do you think we can deploy AI on Blockchain?”. Me: *proceeds to explain him how Blockchain, GPT and generally LLMs work for 20 straight mins*. Him: “😳 Have a fantastic education, sir! *stamped*”. did.

11

2

164

Tanay Mehta

@serious_mehta

1 year

Announcing the LLM Adventures Notebook series on @kaggle, where I will be making notebooks on various interesting use-cases of LLMs and RAG pipelines using Open LLMs and datasets from Kaggle ✨. Check it out:

1

26

164

Tanay Mehta

@serious_mehta

11 months

Time to be cracked af, re-starting the Linear Algebra and Stats playlists from MIT OCW to go back to the basics.

3

1

163

Tanay Mehta

@serious_mehta

1 year

One of those days

5

1

151

Tanay Mehta

@serious_mehta

4 months

Graduated yesterday with a distinction and stuff

16

0

158

Tanay Mehta

@serious_mehta

1 year

I remember learning CUDA C like 2 years ago and then never using it after my internship at NVIDIA was done. Biggest mistake. Now I am going to learn it again (because ✨TRITON✨) and it feels like starting from scratch :(.

6

1

154

Tanay Mehta

@serious_mehta

2 years

So technically I have worked at FAANG?!.

Morning Brew Daily

@mbdailyshow

2 years

How it started How it's going

2

1

147

Tanay Mehta

@serious_mehta

2 years

The notebook that followed my talk in London is now out!. If you want to understand, code and train your own GPT, take a look at it! Modify it, pull it apart and change it as you see fit 🚀. The data loading part is now chill thanks to @lancedb!.

2

35

149

Tanay Mehta

@serious_mehta

3 years

I've published yet another PyTorch training notebook in the AI4Code competition on #Kaggle. This one's using Microsoft's CodeBERT model (thanks to @huggingface🤗). It includes optimizations, a trainer module and @weights_biases logging & exp. tracking 🚀.

1

17

146

Tanay Mehta

@serious_mehta

3 years

Update on an experiment I did a month ago: I asked here on twitter if I should add a separate "Open Source Contributions" section on my CV, which I did. Of the 3 companies I had applied to with this new CV, I received an initial interview call from 2. Success!. cc @amuldotexe.

5

4

145

Tanay Mehta

@serious_mehta

2 years

Thrilled to announce that I will be joining @tuBraunschweig as a Masters's Student in Data Science for the Summer Semester of 2023!. Can't wait to move to Germany and embark on this new journey 🚀

22

0

142

Tanay Mehta

@serious_mehta

1 year

Although the video is coming soon; if you want to pre-train or fine-tune the Mamba model as a Code-completion LLM (like Github Copilot) using a Lance dataset, I have created a repository with training scripts for you 🚀. Only supports single GPU for now .

4

19

135

Tanay Mehta

@serious_mehta

2 years

@Ravisutanjani I don’t know what’s worse, people doing the actual thing or the ones justifying it in this comment section.

3

6

129

Tanay Mehta

@serious_mehta

1 year

Oh Istanbul, you have my heart ❤️🇹🇷

12

3

134

Tanay Mehta

@serious_mehta

4 years

Started my transition from PyTorch to Jax-Flax. Hoping to complete this transition in a few months. Also, will soon start pushing dominantly JAX-based TPU notebooks on Kaggle, details will follow!.

5

6

135

Tanay Mehta

@serious_mehta

4 years

Ok hear me out: "Code Reading Groups". Just like Paper Reading groups, except we collectively read and try to make sense of big opensource projects like Tensorflow and PyTorch. I'm sure I'm not the only one who tried reading code from such a repo and couldn't understand anything.

17

4

130

Tanay Mehta

@serious_mehta

3 years

My Pull Request for adding the Hinge Loss function to @DeepMind's Optax has been merged today! Going to add many more loss functions to Optax (for all you JAX geeks out there 😉)

6

3

131

Tanay Mehta

@serious_mehta

3 years

Final exams of my engineering degree are over. Tanay is a free elf now 🌚.

12

0

123

Tanay Mehta

@serious_mehta

2 years

OMG! Kaggle Models is finally a thing 😍. I remember talking to @Rob13Ell a few months ago where we brainstormed a lot of interesting ways this could turn out and it's so fun seeing it live after all!. Job well-done @kaggle team 👏🏼

5

11

126

Tanay Mehta

@serious_mehta

8 months

EU Blue Card visa 🇪🇺 issued, moving to Berlin 🇩🇪 in a couple weeks!.

11

3

124

Tanay Mehta

@serious_mehta

1 year

life update: reinforcement learning assignment has me coding 12+ hours everyday for the last week. what even is this.

13

0

119

Tanay Mehta

@serious_mehta

2 years

It's official: I am now a Computer Science graduate! Time to celebrate with a slice of pizza and a well-deserved Netflix marathon🍕 . But seriously, all jokes aside, I'm grateful to have made it through this program and can't wait to see what the future holds 🚀

13

1

121

Tanay Mehta

@serious_mehta

2 years

Been writing CUDA kernels since 5 AM, I can't see straight anymore someone please send help.

10

1

117

Tanay Mehta

@serious_mehta

2 years

What's common is that they all left India to make a better life for themselves because our terrible politics, workplace exploitation, casteism and misogny won't let them build one here.

Varinder Bansal 🇮🇳

@varinder_bansal

2 years

What’s common???. CEO of Google .CEO of Microsoft.CEO of Adobe.CEO of YouTube.CEO of Mastercard.CEO of Pepsi.CEO of IBM.CEO of Netapp.CEO of Nokia.CEO of Novartis .CEO of Deloitte.

7

6

118

Tanay Mehta

@serious_mehta

3 years

The real reason why I am doing Open source contributions aggressively is so that I can have all these organizations as infinity stones xD

2

111

Tanay Mehta

@serious_mehta

1 year

Sundays are for creating datasets using @lancedb on @LightningAI Studio so they can be released on Monday 🚀⚡️

4

7

107

Tanay Mehta

@serious_mehta

1 year

The grind stoppeth not. Open Source ML 🙌🏻

6

1

99

Tanay Mehta

@serious_mehta

2 years

@JohnArnoldFndtn “that’s how an RBMK reactor explodes”.

3

0

97

Tanay Mehta

@serious_mehta

1 year

@cmizzy1 Well for starters, we weren’t going through major historical events like every other week.

3

0

95

Tanay Mehta

@serious_mehta

5 months

A weekend in Paris and I want to visit again but for longer. Maybe meet with ML community here next time!

3

2

96

Tanay Mehta

@serious_mehta

11 months

@WagmanSam It’s the Chamber of Secrets if you ordered it from Temu.

0

91

Tanay Mehta

@serious_mehta

4 years

Twitter fam, I need suggestion for an Ubuntu based distribution! .I am going to wipe Windows 11 this weekend and install an Ubuntu based distro so please suggest your personal favorites!. P.S: No, I don't need to try Arch Linux, Linux Mint or Vanilla Debian, please stay away 😂.

66

4

97

Tanay Mehta

@serious_mehta

4 years

Writing PyTorch training pipelines is something I will *never* get bored of.

6

2

91

Tanay Mehta

@serious_mehta

2 months

@Txz67 I was working for US and Israeli startups when I was in India during my undergrad. And I paid for much of my foreign college tuition with the money I saved from working for the startups during Undergrad and Postgrad. Try harder.

3

98

Tanay Mehta

@serious_mehta

4 years

Staying up all night in a hackathon, coding with people while interacting and making friends was such a nice feeling. It all feels like a long time ago. Honestly, if you are not on a video call with your team, staying up at night in an online hackathon, you are missing out!.

5

6

93

Tanay Mehta

@serious_mehta

3 years

These Product design and UI/UX people are pure artists 💯.

6

11

87

Tanay Mehta

@serious_mehta

4 years

We. Need. More. Jax. Tutorials.

5

1

95

Tanay Mehta

@serious_mehta

3 years

Everyone! Quickly enjoy your life, GITHUB IS DOWN. I REPEAT GITHUB IS DOWN

4

22

92

Tanay Mehta

@serious_mehta

12 days

@sidhant @CapitalsLad Me and him are not that different.

1

0

92