
Freddie Vargus
@freddie_v4
Followers
1K
Following
11K
Media
42
Statuses
721
cto & co-founder @quotientai Research @cohere_labs — past: evals @github Copilot, data @quantopian — Tico 🇨🇷🇺🇸
Boston
Joined June 2012
introducing the Quotient MCP Server, our entrypoint for in-the-loop steering of agents. agents can receive information about what kinds of errors they're making in between steps, get feedback from specialized models, and correct themselves
3
9
19
our MCP server is just one component in Limbic, our system which captures and processes agent behavior, helps you understand it, and automatically improves your agents for you. reach out to me or @JuliaANeagu if this is something you're interested in 🙂 quotientai dot co.
0
0
1
and the server is open source and can be found here
github.com
A Model Context Protocol (MCP) server for evaluating tool calls and AI agent interactions. - quotient-ai/quotient-mcp
1
0
2
we have guides for integrating with @cursor_ai @claudeai Code, @AmpCode , and @claudeai Desktop. you can find docs here
1
1
2
our MCP server currently provides a tool for evaluating tool calls, backed by our limbic-tool-use model. more tools are in the works!.
today we're releasing a new small model (0.5B) for detecting problems with tool usage in agents, trained on 50M tokens from publicly available MCP server tools. it's great at picking up on tool accuracy issues and outperforms larger models
1
0
3
RT @SunejaLuv: Tool use hallucinations are real and often ignored. When an agent:. - Invents a function name. - Uses incorrect parameters….
0
2
0
I’ve been moving more and more of my coding off of Cursor and on to Sculptor btw. the vibes are good, and the experience has been pretty nice.
Writing code is just the start. To move beyond prototypes, we need agents that plan, write specs, run tests, follow style guides, and catch bugs before you do. @JoshAlbrecht shared how we're tackling this with Sculptor at @aiDotEngineer World’s Fair:
1
2
15
RT @JuliaANeagu: Our talk with @Tavily is now live — part of the new.@aiDotEngineer Retrieval & Search track. We share a practical frame….
0
7
0
RT @JuliaANeagu: 2⃣ years ago, I convinced @freddie_v4 to take the plunge and start @QuotientAI with me. two years into the crazy ride, we'….
0
2
0
RT @ToolUseAI: 🔥The best AI advice for 2025🔥. Two dozen of the top minds in the AI space share their top advice and lessons learned from th….
0
8
0
RT @JnBrymn: I'm doing AI research - comparing it to how humans think. Think quick, simulate a coin flip in your head. What is the result,….
0
1
0
there's a lot of other ideas we had along the way, and more we're going to do (additional announcements next week) but if you want to chat more about this message us freddie or julia at quotientai dot co or on Discord
blog.quotientai.co
Despite widespread adoption of tool use, there has been no dedicated model for evaluating tool-use accuracy—until now.
0
0
2
we assembled everything back into one dataset (train/val/test, etc), and ended up fine-tuning using @UnslothAI and a single GPU on @modal_labs for 0.5B, 3B, and 7B models. if you want to try out the model, you can find it on hugging face here
huggingface.co
1
0
3