tilderesearch Profile Banner
Tilde Profile
Tilde

@tilderesearch

Followers
3K
Following
180
Media
43
Statuses
100

Doing cool things.

Joined July 2024
Don't wanna be here? Send us removal request.
@tilderesearch
Tilde
4 months
We’re excited to announce that Tilde completed an $8M seed round earlier this year, led by Khosla Ventures. Understanding model intelligence is the most important problem in the world, and the key to actualizing the promise that ASI can offer. 🧵 A thread on our approach:
9
12
174
@tilderesearch
Tilde
14 days
Applications closing today! 👻 If you need an extension to finish up, reach out to us directly, and we can give you a few days.
@tilderesearch
Tilde
1 month
Today we're very happy to announce that we’re launching the Tilde Fellowship Program to support research in a mechanistic understanding of pre-training science (arch, optimizers, learning dynamics, etc.). Much of modern ML progress has come from scaling models and empirically
3
0
26
@tilderesearch
Tilde
19 days
5 days left! 🎃
@tilderesearch
Tilde
1 month
Today we're very happy to announce that we’re launching the Tilde Fellowship Program to support research in a mechanistic understanding of pre-training science (arch, optimizers, learning dynamics, etc.). Much of modern ML progress has come from scaling models and empirically
2
1
27
@tilderesearch
Tilde
25 days
:)
@NousResearch
Nous Research
26 days
art, drinks, open source ai w.s.g. tilde research and general reasoning oct. 24th, SF, 6p
0
0
17
@tilderesearch
Tilde
1 month
Today we're very happy to announce that we’re launching the Tilde Fellowship Program to support research in a mechanistic understanding of pre-training science (arch, optimizers, learning dynamics, etc.). Much of modern ML progress has come from scaling models and empirically
2
12
132
@tilderesearch
Tilde
1 month
Apply here https://t.co/RvvrjAgScS with: - Proposed duration and scope - Compute needs (ballpark GPU ask is fine) - Any teammates (or interest in matching) - Background and motivations Applications are reviewed on a rolling basis, and we’ll get back to you quickly.
tilderesearch.typeform.com
Turn data collection into an experience with Typeform. Create beautiful online forms, surveys, quizzes, and so much more. Try it for FREE.
0
1
17
@tilderesearch
Tilde
1 month
We’ll work side-by-side with fellows, providing compute, mentorship, and direct collaboration with our technical team, along with community support - with the outcome of open-source science and release. Whether you are a graduate student/postdoc with an existing moonshot project
2
1
21
@tilderesearch
Tilde
1 month
Today we're very happy to announce that we’re launching the Tilde Fellowship Program to support research in a mechanistic understanding of pre-training science (arch, optimizers, learning dynamics, etc.). Much of modern ML progress has come from scaling models and empirically
2
12
132
@tilderesearch
Tilde
1 month
Read the full post here: https://t.co/9f2qvFlLNu
Tweet card summary image
tilderesearch.com
0
3
30
@tilderesearch
Tilde
1 month
~6/6~ Experiments demonstrate that our relaxed optimizers maintain improved conditioning over Adam, while offering slightly more flexibility than the Stiefel manifold.
1
0
19
@tilderesearch
Tilde
1 month
~5/6~ These variants are special cases of a single, unifying framework. By applying different self-adjoint projectors (P) to the Gram matrix constraint, one can systematically generate families of manifolds—including the Stiefel manifold and our two proposed constraints— and
1
0
21
@tilderesearch
Tilde
1 month
~4/6~ This perspective yields two alternative manifold constraints: DGram-Muon: Enforces orthogonality while allowing column norms to vary. Oblique-Muon: Enforces unit-norm columns (like Stiefel manifold) while allowing non-orthogonality (i.e. off-diagonal entries can be
1
0
22
@tilderesearch
Tilde
1 month
~3/6~ We frame the optimization problem in Gram space. By operating on the Gram matrix (W^TW), which encodes the inner products between weight vectors, we can directly manipulate key geometric properties like vector norms and orthogonality.
1
1
25
@tilderesearch
Tilde
1 month
~2/6~ Our work extends Manifold Muon (by @thinkymachines ), an optimizer that constrains weights to the Stiefel manifold. Stiefel requires perfect orthonormality (W^TW=I) → a condition number of 1. We investigate whether this constraint can be productively relaxed.
1
0
26
@tilderesearch
Tilde
1 month
Modern optimizers can struggle with unstable training. Building off of Manifold Muon, we explore more lenient mechanisms for constraining the geometry of a neural network's weights directly through their Gram matrix 🧠 A 🧵… ~1/6~
3
27
276
@tilderesearch
Tilde
2 months
Check out @nathancgy4's awesome Deltaformer PR and stay tuned for a post on the architecture soon!
@nathancgy4
Nathan Chen
2 months
SEED's paper on associative memory and DeltaFormer is still one of my favorites 🎉so I'm happy share that DeltaFormer is now supported on FLA (flash linear attention)! Learned incredibly much from @yzhang_cs and Mingyu
0
2
21
@tilderesearch
Tilde
2 months
~4/4~ Here’s the post: https://t.co/5nA2IkBmKr If you’re interested in the intersection of TCS and applied ML, reach out. We’re releasing vignettes to share brief-form thoughts from researchers. Stay tuned for the next one → 😶‍🌫️
Tweet card summary image
tilderesearch.com
0
2
9
@tilderesearch
Tilde
2 months
~3/4~ Fundamental lower and upper bounds on transformer expressivity! We derive from first principles that even a restricted transformer (softmax at zero-temp) can solve problems beyond the circuit class AC⁰ (originally from Merrill et al.) More recent work (e.g. on DeltaNet)
1
0
6
@tilderesearch
Tilde
2 months
~2/4~ Circuits, not Turing machines Another lens on computation is Boolean circuits - DAGs of AND/OR/NOT gates. They’re key to understanding parallel vs serial computation (proving lower bounds here→ P?=NP). It turn we can also use them to analyze transformers!
1
0
6
@tilderesearch
Tilde
2 months
~1/4~ P?=NP 👉 P = problems we can solve efficiently. 👉 NP = problems we can check efficiently once given a solution. The (literally) well-known million-dollar question: are these really the same set of problems? We give an introduction to the P ?= NP problem through the
1
0
6
@tilderesearch
Tilde
2 months
Vignette #2 is here! Join @AlecDewulf to: Learn about circuit complexity theory Derive theoretical capabilities and limitations of transformers Discuss the future of theoretical computer science in architecture design A thread 🧵
1
5
22