EyubogluSabri Profile Banner
Sabri Eyuboglu Profile
Sabri Eyuboglu

@EyubogluSabri

Followers
1K
Following
1K
Media
15
Statuses
446

Working on language model memory. CS PhD student @Stanford working with @HazyResearch and @james_y_zou. 🪬

Joined February 2019
Don't wanna be here? Send us removal request.
@EyubogluSabri
Sabri Eyuboglu
3 months
When we put lots of text (eg a code repo) into LLM context, cost soars b/c of the KV cache’s size. What if we trained a smaller KV cache for our documents offline? Using a test-time training recipe we call self-study, we find that this can reduce cache memory on avg 39x
Tweet media one
13
73
303
@EyubogluSabri
Sabri Eyuboglu
2 days
RT @krandiash: Instead of using ChatGPT, I’m increasingly using Claude Code for non code queries as well, including long form writing, anal….
0
3
0
@grok
Grok
5 days
What do you want to know?.
421
259
2K
@EyubogluSabri
Sabri Eyuboglu
2 days
RT @_khaledsaab: After two amazing years @GoogleDeepMind, I’m now joining @OpenAI to accelerate biomedical intelligence with @thekaransingh….
0
35
0
@EyubogluSabri
Sabri Eyuboglu
6 days
RT @cartesia_ai: Introducing Line by Cartesia: the modern voice agent development platform. Line was built to be code-first, because best-i….
0
53
0
@EyubogluSabri
Sabri Eyuboglu
10 days
RT @DimitrisPapail: Thinking about model generalization is quite painful. We observe empirically that models trained with SGD on cross-en….
0
56
0
@EyubogluSabri
Sabri Eyuboglu
10 days
RT @oshaikh13: If you thought referencing past chats was cool, we built an MCP that lets Claude use *anything you see or do on your compute….
0
33
0
@EyubogluSabri
Sabri Eyuboglu
11 days
RT @krandiash: Excited to see this release from @ShreyaR, they’ve really created a magical experience for builders with Snowglobe. Evals an….
0
1
0
@EyubogluSabri
Sabri Eyuboglu
11 days
RT @ShreyaR: Introducing ❄️ @snowglobe_so, the simulation engine for AI chatbots. Magically simulate the behavior of your users to test an….
0
87
0
@EyubogluSabri
Sabri Eyuboglu
11 days
RT @arithmoquine: new post. there's a lot in it. i suggest you check it out
Tweet media one
0
187
0
@EyubogluSabri
Sabri Eyuboglu
17 days
RT @kalomaze: @teortaxesTex i keep thinking who will stop fucking around and productionize ICL context distillation (i.e Cartridges paper)….
0
2
0
@EyubogluSabri
Sabri Eyuboglu
26 days
RT @willccbb:
Tweet media one
0
8
0
@EyubogluSabri
Sabri Eyuboglu
27 days
RT @EricTopol: Who do you call when you need to design novel, potent nanobodies vs a pathogen?.The virtual lab of A.I. agents @Nature @jame….
0
50
0
@EyubogluSabri
Sabri Eyuboglu
27 days
RT @james_y_zou: ⚡️Thrilled that #VirtualLab is published in @Nature! We created a team of AI agents to mirror my….
0
250
0
@EyubogluSabri
Sabri Eyuboglu
1 month
RT @jordanjuravsky: Check out Tokasaurus on Modal to make Llama-1B brrr! This repeated sampling example shows off two engine features that….
0
8
0
@EyubogluSabri
Sabri Eyuboglu
1 month
RT @charles_irl: Tokasaurus, the "little LLM engine that could" by @jordanjuravsky and @EyubogluSabri of @HazyResearch/@ScalingIntelLab, is….
0
9
0
@EyubogluSabri
Sabri Eyuboglu
1 month
RT @dan_biderman: Train the entries of your KV caches . @EyubogluSabri
Tweet media one
0
2
0
@EyubogluSabri
Sabri Eyuboglu
1 month
RT @ryansehrlich: Thank you for the kind words -- we can't either! We're really excited about models learning new things and remembering t….
0
7
0
@EyubogluSabri
Sabri Eyuboglu
1 month
RT @bio_bootloader: Cartridges could be this "missing learning paradigm" Karpathy talks about. 1) agent does tasks, collects memories that….
0
5
0
@EyubogluSabri
Sabri Eyuboglu
1 month
RT @snowclipsed: found my weekend experiment.
0
2
0
@EyubogluSabri
Sabri Eyuboglu
1 month
RT @ESFoMo: Looking forward to seeing everyone for ES-FoMo part three tomorrow! We'll be in East Exhibition Hall A (the big one), and we've….
0
22
0
@EyubogluSabri
Sabri Eyuboglu
1 month
RT @realDanFu: ES-FoMo is back tomorrow! Come join is in East Exhibition Hall A bright and early at 8:30AM for a great slate of invited tal….
0
2
0