estsauver Profile Banner
Earl St Sauver Profile
Earl St Sauver

@estsauver

Followers
240
Following
223
Media
36
Statuses
829

Amsterdam, The Netherlands
Joined January 2009
Don't wanna be here? Send us removal request.
@estsauver
Earl St Sauver
1 day
RT @darrenangle: *sniff* *pulls shirt* You know, this is perfect - *gestures wildly* - this is the ultimate perversity of capitalism at its….
0
515
0
@estsauver
Earl St Sauver
8 days
RT @LemmySmackett: "Okay, so imagine a magic button.". "I'm imagining the button.". "If you press the button—". "What color is it?". "It's….
0
3K
0
@estsauver
Earl St Sauver
1 month
Me tweeting at Delta to not lose my bags
Tweet media one
0
0
2
@estsauver
Earl St Sauver
1 month
Hey @GeminiApp , is the python-genai github being watched? I think grounding/citations is still broken and it doesn't seem like stuff is getting triaged? .
Tweet card summary image
github.com
Hi Y'all! I'm trying to route a gemini grounding issue to the right team--it seems the links to groundings aren't appearing in the python sdk. Got routed here from GoogleCloudPlatform/g...
0
0
0
@estsauver
Earl St Sauver
2 months
RT @JohnHolbein1: Timeless advice
0
477
0
@estsauver
Earl St Sauver
2 months
Today I’ll be releasing my new eval called “naming things.” @StreamOnMax and @OpenAI.
0
0
0
@estsauver
Earl St Sauver
3 months
I think @ESYudkowsky 's AI safety argument gets decently validated when xAI can't rollback an obvious problem and intentional bias in their system prompt quickly before everyone knows it. Whatever you think about what's happening in AI safety generally, it's clear that Grok needs.
0
0
0
@estsauver
Earl St Sauver
3 months
There may have been some signs with Kanye in retrospect….
0
0
0
@estsauver
Earl St Sauver
3 months
OpenAI has this, I just want something like this.
0
0
0
@estsauver
Earl St Sauver
3 months
1
0
0
@estsauver
Earl St Sauver
3 months
First one's up: How can you know how much Gemini costs for tool use?.
@estsauver
Earl St Sauver
3 months
@OfficialLoganK if I recorded a bunch of screen shares of me trying to figure out how to do stuff with Gemini and talking to myself would that be useful? . I *want* and pro/flash are clearly amazing models, but the API layer is a problem.
1
0
0
@estsauver
Earl St Sauver
3 months
It’s weird how Uncle Bob gets a twitter account and then everyone realizes that if you do what he suggests your company dies.
@ryanjfleury
Ryan Fleury
3 months
Tweet media one
0
0
1
@estsauver
Earl St Sauver
3 months
It seems like MLX doesn't actually increase speeds relative to GGUF for decently sized models on macs? > 20B params it definitely is slower at least on LM Studio.
0
0
0
@estsauver
Earl St Sauver
3 months
I put together a small benchmark of LocalLLMs running on an m4 pro-max with a lot of ram to give a qualitative sense of how running local models feels/speeds, let me know if you want any other info.
1
0
0
@estsauver
Earl St Sauver
3 months
Although Github link currently isn't found--I've emailed one of the authors (@ypwang61) and so I hope it'll be up soon!.
0
0
0
@estsauver
Earl St Sauver
3 months
Full details, code, and prompts are in the paper. Worth reading if you are interested in data-efficient RL methods for language models and in understanding how exploration bonuses interact with reasoning ability.
1
0
0
@estsauver
Earl St Sauver
3 months
Key takeaway: large language models already encode much of the required knowledge; a single well-selected, auto-verifiable example can unlock it through reinforcement learning.
1
0
0
@estsauver
Earl St Sauver
3 months
Cross-domain evaluation shows spill-over benefits: training on a single Geometry question also improves Algebra, Number Theory and Combinatorics accuracy.
1
0
0