0xSMW Profile Banner
Stephen Profile
Stephen

@0xSMW

Followers
1K
Following
17K
Media
2K
Statuses
14K

optimizing LLM-powered apps / co-founder @klu_ai

BKK
Joined May 2008
Don't wanna be here? Send us removal request.
@0xSMW
Stephen
5 years
“We are choked with news, and starved of history.” — Will Durant https://t.co/04gpLaKfGQ
5
0
29
@0xSMW
Stephen
2 days
well well well
@jasondeanlee
Jason Lee
2 days
Supposedly gpt 5 was based on the 4o pretrain. 4.5 and after preteian was all tossed.
0
0
0
@0xSMW
Stephen
2 days
grokipedia is now automating cross-site links — some better than others
0
0
0
@0xSMW
Stephen
2 days
le tiret du jour
0
0
0
@0xSMW
Stephen
4 days
gemini nano banana pro is really something unique and special when it comes to text coherence and complexity
0
0
0
@0xSMW
Stephen
4 days
everyone knows you scale on Thursday, so that you can research on Friday
@johncoogan
John Coogan
5 days
If you want to have a peaceful Thanksgiving this year, please make a rule with your family not to discuss whether AI is in an age of scaling or an age or research.
0
0
0
@0xSMW
Stephen
5 days
man the style sucks, but the info is legit
@0xSMW
Stephen
5 days
a strange bug - nano banana pro outputting code instead of an image @GeminiApp
1
0
0
@0xSMW
Stephen
5 days
a strange bug - nano banana pro outputting code instead of an image @GeminiApp
1
0
0
@0xSMW
Stephen
5 days
somehow the best 2025 idea seems like a 2005 idea: a website that curates the best human sites and writers on the web, using rss to keep it updated
1
0
0
@0xSMW
Stephen
5 days
I was skeptical of LLMs providing time estimates because most generated plans provide multi-week plans for education or engineering tasks. But then I took the prompt used in the paper and tried it with a few models using my own conversations, and it's pretty close. I noticed
Tweet card summary image
anthropic.com
Anthropic economic research on productivity gains
0
0
1
@0xSMW
Stephen
6 days
man, I like @gruber and he is very wise, but I wish he had a friend who wasn't so blackpilled on Elon
0
0
0
@0xSMW
Stephen
6 days
codex-5.1 found a unique issue in rss generation where I pulled the global update time instead of the post's opus-4.5 found 6 issues, all non-issues that were deliberate design decisions
0
0
0
@0xSMW
Stephen
6 days
gemini-3 still has the tool call issues found in gemini-2.5 — however, unlike codex-5.1-max and opus-4.5, it spotted an issue impacting my vercel deployment
1
0
0
@0xSMW
Stephen
6 days
probably good for SF folks but really missed the mark for mom
@OpenAI
OpenAI
6 days
Introducing shopping research, a new experience in ChatGPT that does the research to help you find the right products. It’s everything you like about deep research but with an interactive interface to help you make smarter purchasing decisions.
0
0
0
@0xSMW
Stephen
6 days
the claude family, now including opus 4.5, are still not in this club. opus 4.5 spends 20k tokens and provides the same incorrect answer as previous generations. anthropic's approach to reasoning is unlike the other labs and it shows in math/science problems.
@0xSMW
Stephen
12 days
gemini 3 now joins this club
0
0
0
@0xSMW
Stephen
6 days
then thought for 8 mins to answer "how far is the moon", not sure how this model is usable for local inference
0
0
0
@0xSMW
Stephen
6 days
geeze olmo, a simple hi would suffice
1
0
0
@0xSMW
Stephen
7 days
and somehow, no amount of instructions to be concise, minimize chattiness, and not use bullets seem to matter — it's very smart, and also a bad communicator compared to o3.
0
0
0
@0xSMW
Stephen
7 days
same prompt, o3 vs gpt-5.1 — 5.1 is grossly verbose
1
0
0