
Lance Martin
@RLanceMartin
Followers
15K
Following
6K
Media
417
Statuses
1K
langchain. past: robots 🚘 🤖, phd @stanford 🧪
San Francisco, CA
Joined May 2009
Common “context engineering” patterns. Loved @dbreunig posts on this. I also wrote up some thoughts:.
As your context bloats, you hit different failure modes. These failures hit agents hardest because they operate in exactly the scenarios where contexts balloon: gathering information, making sequential tool calls, engaging in multi-turn reasoning, & accumulating histories.
3
7
28
Some useful references --.1/ @karpathy on LLMs as OS.2/ @walden_yan on context engineering.3/ @barry_zyj + team multi-agent.4/ @AymericRoucher + team on deep research.5/ @bcherny on.
1
6
34
a few thoughts on the current state of agents based on what I saw at @aiDotEngineer: . rise of "ambient" agents. the bitter lesson & agent UX. RL for non-verifiable tasks. the case for MCP. early days for agent memory .
5
19
115
@kevinhou22 on Windsurf:.> Current dev workflow centric .> Highly opinionated UI / IDE.> Allows for granular data capture .> Lets them train models. @mntruell w/ @benthompson pod similar point; long “messy middle” of devs + AI working together preserves need for IDE.
0
0
6
Most interesting AI product question I took from @aiDotEngineer is Claude Code vs IDEs (Cursor/Windsurf). @bcherny on Claude Code: .> Bitter lesson centric .> General models win.> General things around model win.> Unopionionated / no UI.> Work w fast changing UX / models
2
1
39
Some notes from @aiDotEngineer day 1 -. @simonw on state of AI.> Visual eval for LLMs: asked each LLM to generate code for an SVG image of a pelican riding a bicycle. Ran this across ~30 model releases over the past 6 months. Created a script to select random image pairs, GPT4.1.
5
19
114
Fundamentals -- Tool calling, agents v workflows (h/t @barry_zyj, @ErikSchluntz), persistence/checkpointing. Notebook:.Slides:
1
1
8