Stephen @0xSMW X Profile

Stephen

@0xSMW

Followers

1K

Following

17K

Media

2K

Statuses

14K

optimizing LLM-powered apps / co-founder @klu_ai

https://t.co/aigSJ16eGr

BKK

Joined May 2008

Don't wanna be here? Send us removal request.

Stephen

@0xSMW

5 years

“We are choked with news, and starved of history.” — Will Durant https://t.co/04gpLaKfGQ

5

0

29

Stephen

@0xSMW

2 days

well well well

Jason Lee

@jasondeanlee

2 days

Supposedly gpt 5 was based on the 4o pretrain. 4.5 and after preteian was all tossed.

0

Stephen

@0xSMW

2 days

grokipedia is now automating cross-site links — some better than others

0

Stephen

@0xSMW

2 days

le tiret du jour

0

Stephen

@0xSMW

4 days

gemini nano banana pro is really something unique and special when it comes to text coherence and complexity

0

Stephen

@0xSMW

4 days

everyone knows you scale on Thursday, so that you can research on Friday

John Coogan

@johncoogan

5 days

If you want to have a peaceful Thanksgiving this year, please make a rule with your family not to discuss whether AI is in an age of scaling or an age or research.

0

Stephen

@0xSMW

5 days

based on https://t.co/LJZjNvCgbB

smw.ai

An evidence-based forecast for the CCP invasion

0

Stephen

@0xSMW

5 days

man the style sucks, but the info is legit

Stephen

@0xSMW

5 days

a strange bug - nano banana pro outputting code instead of an image @GeminiApp

1

0

Stephen

@0xSMW

5 days

a strange bug - nano banana pro outputting code instead of an image @GeminiApp

1

0

Stephen

@0xSMW

5 days

somehow the best 2025 idea seems like a 2005 idea: a website that curates the best human sites and writers on the web, using rss to keep it updated

1

0

Stephen

@0xSMW

5 days

I was skeptical of LLMs providing time estimates because most generated plans provide multi-week plans for education or engineering tasks. But then I took the prompt used in the paper and tried it with a few models using my own conversations, and it's pretty close. I noticed

anthropic.com

Anthropic economic research on productivity gains

0

1

Stephen

@0xSMW

6 days

man, I like @gruber and he is very wise, but I wish he had a friend who wasn't so blackpilled on Elon

0

Stephen

@0xSMW

6 days

codex-5.1 found a unique issue in rss generation where I pulled the global update time instead of the post's opus-4.5 found 6 issues, all non-issues that were deliberate design decisions

0

Stephen

@0xSMW

6 days

gemini-3 still has the tool call issues found in gemini-2.5 — however, unlike codex-5.1-max and opus-4.5, it spotted an issue impacting my vercel deployment

1

0

Stephen

@0xSMW

6 days

probably good for SF folks but really missed the mark for mom

OpenAI

@OpenAI

6 days

Introducing shopping research, a new experience in ChatGPT that does the research to help you find the right products. It’s everything you like about deep research but with an interactive interface to help you make smarter purchasing decisions.

0

Stephen

@0xSMW

6 days

the claude family, now including opus 4.5, are still not in this club. opus 4.5 spends 20k tokens and provides the same incorrect answer as previous generations. anthropic's approach to reasoning is unlike the other labs and it shows in math/science problems.

Stephen

@0xSMW

12 days

gemini 3 now joins this club

0

Stephen

@0xSMW

6 days

then thought for 8 mins to answer "how far is the moon", not sure how this model is usable for local inference

0

Stephen

@0xSMW

6 days

geeze olmo, a simple hi would suffice

1

0

Stephen

@0xSMW

7 days

and somehow, no amount of instructions to be concise, minimize chattiness, and not use bullets seem to matter — it's very smart, and also a bad communicator compared to o3.

0

Stephen

@0xSMW

7 days

same prompt, o3 vs gpt-5.1 — 5.1 is grossly verbose

1

0