Ben @SolidlySheafy X Profile

Ben

@SolidlySheafy

Followers

373

Following

282

Media

0

Statuses

32

Understanding intelligence @tilderesearch // prev math @Penn and @Cambridge_Uni

Joined March 2024

Don't wanna be here? Send us removal request.

Ben

@SolidlySheafy

12 days

🔭!

Tilde

@tilderesearch

12 days

Applications closing today! 👻 If you need an extension to finish up, reach out to us directly, and we can give you a few days.

0

6

Ben

@SolidlySheafy

30 days

Please apply!! We are very excited about this program and are looking forward to working with folks on some really interesting questions

Tilde

@tilderesearch

30 days

Today we're very happy to announce that we’re launching the Tilde Fellowship Program to support research in a mechanistic understanding of pre-training science (arch, optimizers, learning dynamics, etc.). Much of modern ML progress has come from scaling models and empirically

0

11

Ben

@SolidlySheafy

1 month

We really enjoyed the @thinkymachines post on Manifold Muon from a couple of weeks ago and decided to share some thoughts. Make sure to check out the original post as well:

thinkingmachines.ai

A geometric framework for co-designing neural net optimizers with manifold constraints.

Tilde

@tilderesearch

1 month

Modern optimizers can struggle with unstable training. Building off of Manifold Muon, we explore more lenient mechanisms for constraining the geometry of a neural network's weights directly through their Gram matrix 🧠 A 🧵… ~1/6~

2

13

214

Ben

@SolidlySheafy

2 months

Have really enjoyed learning from Alec, hope people like the post!

Tilde

@tilderesearch

2 months

Vignette #2 is here! Join @AlecDewulf to: Learn about circuit complexity theory Derive theoretical capabilities and limitations of transformers Discuss the future of theoretical computer science in architecture design A thread 🧵

0

1

5

Ben

@SolidlySheafy

2 months

Come by our booth! We are giving out shirts to anyone who can solve our challenge problems :)

Tilde

@tilderesearch

2 months

We'll be at the Berkeley EECS Career Fair this Tuesday & Wednesday with cool custom puzzles (and prizes). @berkeley_ai @UCBerkeley

0

8

Ben

@SolidlySheafy

3 months

Really enjoyed reading this!

Tilde

@tilderesearch

3 months

Today we’re launching our weekly vignette series! 📚 Vignette #1: Attention through a regression lens 📈 • derive attention from regression • analyze the geometry of kernel smoothers • introduce Epanechnikov attention (feature map + recurrent form) A thread 🧵

0

5

Ben

@SolidlySheafy

3 months

Hope people have a fun time with this one :)

Tilde

@tilderesearch

3 months

Our puzzle series is back! 🧩 This one’s about how linear attention emerges from a simple quadratic objective - and how to extend it! Think you got a shot? Try it out here → A thread 🧵

0

4

Ben

@SolidlySheafy

3 months

We believe there is a lot of interesting work that is not traditionally labeled as interpretability, and we also wanted to share a few words about our overall approach.

Tilde

@tilderesearch

3 months

At Tilde, we believe mechanistic understanding of models is key to enabling entirely new architectures and capabilities. We’ve put together a position piece on what interpretability means to us. A thread 🧵

0

8

Ben

@SolidlySheafy

4 months

Hope this is helpful to others!

Tilde

@tilderesearch

4 months

Mixture‑of‑Experts (MoE) powers many frontier models like R1, K2, & Qwen3 ⚡️ To make frontier-scale MoE models accessible to train, we open-source MoMoE, a hyper-performant MoE implementation built for training and inference, outpacing the fastest existing ones by up to: - 70%

0

5

Ben

@SolidlySheafy

4 months

Excited to share soon what we've been working on since :)

Tilde

@tilderesearch

4 months

We’re excited to announce that Tilde completed an $8M seed round earlier this year, led by Khosla Ventures. Understanding model intelligence is the most important problem in the world, and the key to actualizing the promise that ASI can offer. 🧵 A thread on our approach:

0

13

Ben

@SolidlySheafy

5 months

Some work we did on understanding some of the recent sparse attention mechanisms

Tilde

@tilderesearch

5 months

Sparse attention (MoBA/NSA) trains faster & beats full attention in key tasks. But we’ve had no idea how they truly work…until now. 🔍 We reverse-engineered them to uncover: - Novel attention patterns - Hidden "attention sinks" - Better performance - And more A 🧵… ~1/8~

1

0

6

Christopher Settles

@never_settles_

6 months

🎉Excited to announce today that @erikqu_ and I have been accepted into the X25 @ycombinator batch. Erik and I met in 2014 in high school doing coding competitions together. >10 years later, we're building @operative_sh - the world's first web app coding & debugging agent. 🧵

22

14

112

Ben

@SolidlySheafy

8 months

Hopefully useful contribution to the community!

Tilde

@tilderesearch

8 months

Today, we open-source Activault, a simple, high-throughput, and cost-effective solution to activation data management for accelerating interpretability research on frontier models. A 🧵… ~1/6~ https://t.co/ZMTbtHRUJD

0

6

Michael Hla

@hla_michael

8 months

I taught an LLM to optimize proteins. It proposed a better carbon capture enzyme. Introducing Pro-1, an 8b param reasoning model trained using GRPO towards a physics based reward function for protein stability. It takes in a protein sequence + text description + previous

93

332

3K

Tilde

@tilderesearch

8 months

And if you don’t like graph theory, but do like interpretability, we have plenty of other fun problems so feel free to email us join@tilderesearch.com. We are doing a lot of applied interpretability work like this: https://t.co/WP9MmyLwCv, which was the first application of

tilderesearch.com

0

1

4

Ben

@SolidlySheafy

8 months

It's a fun problem!

Tilde

@tilderesearch

8 months

Over the past few weeks, we've been using this graph theory problem in interviews and figured we'd open it up to everyone here! https://t.co/GQCg2rk1Q4 If you solve it, we’ll move you directly to the last rounds of our process!

0

3

Ben

@SolidlySheafy

11 months

It was so great to work on this with Adam!

Adam Karvonen

@a_karvonen

11 months

I'm excited about our case study! SAEs can be practical for preventing behavior, especially rare behavior, as we only intervene when needed. Other tools, like finetuning or system prompts, have side effects as they modify every forward pass. Some thoughts below.

0

6

Ben

@SolidlySheafy

11 months

We used SAEs to outperform baselines on a real-world task! Check it out!

Tilde

@tilderesearch

11 months

Mechanistic interpretability is fascinating - but can it be useful? In particular, can it beat strong baselines like steering and prompting on downstream tasks that people care about? The answer is, resoundingly, yes. Our new blog post with @a_karvonen, Sieve, dives into the

0

5

Ben

@SolidlySheafy

1 year

Unbelievably excited to announce this. We genuinely want to understand the universe, and think building interpretable AGI is the way to get us there :)

Tilde

@tilderesearch

1 year

We're thrilled to be launching Tilde. We're applying interpretability to unlock deep reasoning and control of models, enabling the next generation of human-AI interaction. By understanding a model's inner mechanisms, we can enhance both its reliability and performance—going

0

1

11

Institut des Hautes Études Scientifiques - IHES

@Institut_IHES

1 year

Warmest Birthday wishes to mathematician extraordinaire Pierre Deligne, Fields Medallist and Abel Prize Winner, Permanent Professor at IHES from 1970 to 1984.

0

11

71