
Sandesh Katakam
@sandeshkatakam
Followers
203
Following
9K
Media
6
Statuses
477
BS-MS'25 (Math Major + CS Minor) @IISER_Berhampur sometimes I care too much about the interpretability of Machine learning models.😅Generative Models, RL, MLSys
Hyderabad, India
Joined December 2015
Now and then I miss the days of doing Gaussian process research. ML/AI was a simpler world, driven by clear ideas, principles, and goals. People were there out of passion, not because it's what everyone is doing, not as a ticket to a glamorous job. Less restlessness, less fomo.
39
28
634
Session 15 of the School of AI started yesterday! We are delighted to announce the start of Session 15 of the School of AI, our flagship programme designed to foster cutting-edge innovation and address transformative challenges in artificial intelligence. This session will
0
1
2
Our @ycombinator cofounder video! In 6 weeks, we built state-of-the-art (SOTA) chip design agents, and caught 5 bugs in a certain company's next AI chip (saving an estimated $5M) Check us out: https://t.co/MlaOno6In3
67
99
2K
One of the craziest O1 YC hackathon entries was a service that would give you a Jupyter notebook with runnable examples for ML papers tailored to your own personal understanding of ML You can definitely see hyper personalized instruction is inevitable yet likely pushed back
6
7
191
BREAKING NEWS The Royal Swedish Academy of Sciences has decided to award the 2024 #NobelPrize in Physics to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”
1K
13K
33K
went to MIT career fair today, all the quant shops threw away my resume. citadel seems orthogonal to my post-teaching plans. GS
92
267
7K
After almost a decade, I have made the decision to leave OpenAI. The company’s trajectory has been nothing short of miraculous, and I’m confident that OpenAI will build AGI that is both safe and beneficial under the leadership of @sama, @gdb, @miramurati and now, under the
1K
2K
26K
Excited to attend Oxford Machine Learning summer School! Kicked off with MLx fundamentals online today, and can't wait for the in-person sessions on MLx Rep. learning&Gen AI, and MLx Health/Bio at Oxford University in July. Looking forward to meeting fellow participants! #OxML
0
0
8
Clearly LLMs must one day run in Space Step 1 we harden llm.c to pass the NASA code standards and style guides, certifying that the code is super safe, safe enough to run in Space. https://t.co/tYGrfdka4X (see the linked PDF) LLM training/inference in principle should be super
307
459
5K
I've been toying around with stochastic trace estimation using linear operators, and ended up collecting things into an easy-to-use Python module, traceax. It's built on top of lineax, which is a wonderful linear op library built on top JAX primitives 👇 https://t.co/vAjlXB48Wp
github.com
Stochastic trace estimation using JAX. Contribute to mancusolab/traceax development by creating an account on GitHub.
2
8
56
Excited to share Penzai, a JAX research toolkit from @GoogleDeepMind for building, editing, and visualizing neural networks! Penzai makes it easy to see model internals and lets you inject custom logic anywhere. Check it out on GitHub: https://t.co/mas2uiMqj9
39
398
2K
19 years ago today MIT researchers got a computer-generated gibberish paper accepted to a predatory journal: https://t.co/kEQY9GJI3Z Generate your own here: https://t.co/mP4LW13kAk
9
43
187
What do you call the disparity between GPU-rich and GPU-poor? Jensen's inequality
16
145
1K
Happy to share our recent work, Empowering Clinicians with MeDT: A Framework for sepsis Treatment" presented as a spotlight at the recently concluded #Neurips2023 Goal-Conditioned Reinforcement Learning Workshop. #RL #Neurips @Mila_Quebec
1
6
25
Over the past months, I worked on a new approach to the interpretability of LMs. Instead of zooming in, I decided to zoom out: are there such things as "organs" inside LLMs? 🧠... the answer might be yes! Very excited to share my new paper: https://t.co/0a1IZR1OAF 🥳🥳
3
14
106
🌟Time for another blog post! :D🌟 "No more shape errors! Type annotations for the shape+dtype of tensors/arrays." Link: https://t.co/wFuzq51OmO I think the audience for this one is nearly everyone who uses PyTorch / NumPy / JAX / TensorFlow.🤖
kidger.site
Personal Website. Math, SciML, scuba diving!
7
32
216
Test of Time Award @ NeurIPS 2023 to "Word2Vec" Paper (2013) published 10 years ago at ICLR. The @JeffDean's Talk! 🤩
0
0
0
Attending NeurIPS 2023 for the first time (on a Virtual Pass!) Excited!!🤩 Hopefully attend next NeurIPS in-person 🤞 #NeurIPS2023
0
0
6
Fast-forward ⏩ alignment research from @GoogleDeepMind ! Our latest results enhance alignment outcomes in Large Language Models (LLMs). Presenting NashLLM!
4
129
809