
Anish Mudide
@amudide
Followers
346
Following
485
Media
9
Statuses
42
RT @AchyutaBot: 🧵Can we understand vision language models by interpreting linear directions in their latents?. Yes! In our new paper, Line….
0
26
0
I'm at ICLR to present Switch SAEs. Come by 3pm - 5:30pm today at Hall 3 + Hall 2B #272.
Sparse autoencoders (SAEs) allow us to peer into the inner workings of language models, but scaling them to frontier models is expensive. In our new paper, we introduce Switch Sparse Autoencoders, a novel architecture aimed at reducing the cost of training SAEs. 🧵 (1/13):
4
2
24
RT @match_ten: (1/11) New paper! “Low-rank adapting models for Sparse Autoencoders.” While SAEs find interpretable latents, they hurt downs….
0
15
0
Our paper on Switch Sparse Autoencoders has been accepted to ICLR 2025 – see you in 🇸🇬!.
Sparse autoencoders (SAEs) allow us to peer into the inner workings of language models, but scaling them to frontier models is expensive. In our new paper, we introduce Switch Sparse Autoencoders, a novel architecture aimed at reducing the cost of training SAEs. 🧵 (1/13):
7
14
173
RT @ethrbt_design: 🦜Introducing the Stochastic Parrot 🦜: An AI-powered motivational companion!. The Stochastic Parrot sits on your shoulder….
0
5
0
RT @ericjmichaud_: Since the internal structure of neural networks, through training, comes to reflect the structure of the external world,….
0
2
0
RT @TransluceAI: Announcing Transluce, a nonprofit research lab building open source, scalable technology for understanding AI systems and….
0
147
0
RT @JoshAEngels: 1/11: New paper! "Decomposing the Dark Matter of Sparse Autoencoders." We find that SAE errors and error norms are linear….
0
37
0
This work would not have been possible without support from the @MATSprogram. I'd also like to thank my collaborators @JoshAEngels, @ericjmichaud_, @tegmark, and @casdewitt! (13/13).
0
0
11