
Savannah
@SavannahFeder
Followers
7K
Following
3K
Media
44
Statuses
392
RT @fdotinc: F&#! doing groceries on vacation. Introducing - a supermarket inside your Airbnb.
0
259
0
agent hack night at github 🤝. demos + talks on: .- llm observability & evals by @arizeai .- multi-agent framework from @crewAIInc .- efficient vector search from daxe.- fast and cheap ai inference by @friendliai . ft. @zaw358 🫡
4
3
43
Seems I was wrong on this…. o1 doesn’t perform much better on agentic tasks than other frontier models. Which is pretty unexpected given its supposed IQ of 120.
o1 might cause a paradigm shift for agents. today’s agents suck at many-step workflows - since one poor decision snowballs into a chain of errors. but o1 will lead to way fewer reasoning errors, meaning agents will crack processes they just couldn't before.
2
0
13