Sandesh Katakam @sandeshkatakam X Profile

Sandesh Katakam

@sandeshkatakam

Followers

203

Following

9K

Media

6

Statuses

477

BS-MS'25 (Math Major + CS Minor) @IISER_Berhampur sometimes I care too much about the interpretability of Machine learning models.😅Generative Models, RL, MLSys

https://t.co/DFOCgIQafG

Hyderabad, India

Joined December 2015

Don't wanna be here? Send us removal request.

Andrew Gordon Wilson

@andrewgwils

8 months

Now and then I miss the days of doing Gaussian process research. ML/AI was a simpler world, driven by clear ideas, principles, and goals. People were there out of passion, not because it's what everyone is doing, not as a ticket to a glamorous job. Less restlessness, less fomo.

39

28

634

Pi School

@picampusschool

10 months

Session 15 of the School of AI started yesterday! We are delighted to announce the start of Session 15 of the School of AI, our flagship programme designed to foster cutting-edge innovation and address transformative challenges in artificial intelligence. This session will

0

1

2

Sathvik Redrouthu

@_sathvikr

11 months

Our @ycombinator cofounder video! In 6 weeks, we built state-of-the-art (SOTA) chip design agents, and caught 5 bugs in a certain company's next AI chip (saving an estimated $5M) Check us out: https://t.co/MlaOno6In3

67

99

2K

Garry Tan

@garrytan

1 year

One of the craziest O1 YC hackathon entries was a service that would give you a Jupyter notebook with runnable examples for ML papers tailored to your own personal understanding of ML You can definitely see hyper personalized instruction is inevitable yet likely pushed back

6

7

191

The Nobel Prize

@NobelPrize

1 year

BREAKING NEWS The Royal Swedish Academy of Sciences has decided to award the 2024 #NobelPrize in Physics to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”

1K

13K

33K

Gilbert Strang

@GilStrangMIT

1 year

went to MIT career fair today, all the quant shops threw away my resume. citadel seems orthogonal to my post-teaching plans. GS

92

267

7K

Sahar Abdelnabi 🕊

@sahar_abdelnabi

1 year

NeurIPS reviewer: solution is not new and was heavily studied. @NeurIPSConf #NeurIPS24

9

71

927

Ilya Sutskever

@ilyasut

1 year

After almost a decade, I have made the decision to leave OpenAI. The company’s trajectory has been nothing short of miraculous, and I’m confident that OpenAI will build AGI that is both safe and beneficial under the leadership of @sama, @gdb, @miramurati and now, under the

1K

2K

26K

Sandesh Katakam

@sandeshkatakam

1 year

Excited to attend Oxford Machine Learning summer School! Kicked off with MLx fundamentals online today, and can't wait for the in-person sessions on MLx Rep. learning&Gen AI, and MLx Health/Bio at Oxford University in July. Looking forward to meeting fellow participants! #OxML

0

8

Andrej Karpathy

@karpathy

1 year

Clearly LLMs must one day run in Space Step 1 we harden llm.c to pass the NASA code standards and style guides, certifying that the code is super safe, safe enough to run in Space. https://t.co/tYGrfdka4X (see the linked PDF) LLM training/inference in principle should be super

307

459

5K

Nicholas Mancuso

@nmancuso_

1 year

I've been toying around with stochastic trace estimation using linear operators, and ended up collecting things into an easy-to-use Python module, traceax. It's built on top of lineax, which is a wonderful linear op library built on top JAX primitives 👇 https://t.co/vAjlXB48Wp

github.com

Stochastic trace estimation using JAX. Contribute to mancusolab/traceax development by creating an account on GitHub.

2

8

56

Daniel Johnson

@_ddjohnson

1 year

Excited to share Penzai, a JAX research toolkit from @GoogleDeepMind for building, editing, and visualizing neural networks! Penzai makes it easy to see model internals and lets you inject custom logic anywhere. Check it out on GitHub: https://t.co/mas2uiMqj9

39

398

2K

MIT CSAIL

@MIT_CSAIL

2 years

19 years ago today MIT researchers got a computer-generated gibberish paper accepted to a predatory journal: https://t.co/kEQY9GJI3Z Generate your own here: https://t.co/mP4LW13kAk

9

43

187

Peter J. Liu

@peterjliu

2 years

What do you call the disparity between GPU-rich and GPU-poor? Jensen's inequality

16

145

1K

aamer AR

@aamer_ar1

2 years

Happy to share our recent work, Empowering Clinicians with MeDT: A Framework for sepsis Treatment" presented as a spotlight at the recently concluded #Neurips2023 Goal-Conditioned Reinforcement Learning Workshop. #RL #Neurips @Mila_Quebec

1

6

25

Alexandre Variengien

@A_Variengien

2 years

Over the past months, I worked on a new approach to the interpretability of LMs. Instead of zooming in, I decided to zoom out: are there such things as "organs" inside LLMs? 🧠... the answer might be yes! Very excited to share my new paper: https://t.co/0a1IZR1OAF 🥳🥳

3

14

106

Patrick Kidger

@PatrickKidger

2 years

🌟Time for another blog post! :D🌟 "No more shape errors! Type annotations for the shape+dtype of tensors/arrays." Link: https://t.co/wFuzq51OmO I think the audience for this one is nearly everyone who uses PyTorch / NumPy / JAX / TensorFlow.🤖

kidger.site

Personal Website. Math, SciML, scuba diving!

7

32

216

Sandesh Katakam

@sandeshkatakam

2 years

Test of Time Award @ NeurIPS 2023 to "Word2Vec" Paper (2013) published 10 years ago at ICLR. The @JeffDean's Talk! 🤩

0

Sandesh Katakam

@sandeshkatakam

2 years

Attending NeurIPS 2023 for the first time (on a Virtual Pass!) Excited!!🤩 Hopefully attend next NeurIPS in-person 🤞 #NeurIPS2023

0

6

Michal Valko

@misovalko

2 years

Fast-forward ⏩ alignment research from @GoogleDeepMind ! Our latest results enhance alignment outcomes in Large Language Models (LLMs). Presenting NashLLM!

4

129

809