Fulcrum
@fulcrumML
Followers
269
Following
31
Media
3
Statuses
17
Joined August 2025
Today, we’re announcing Fulcrum Research, a startup scaling human oversight. We are building debuggers that tell you why your agents fail, and what your rewards are truly testing for—the first step toward the inference-time infrastructure required to safely deploy agents.
19
29
199
Check it out on GitHub! https://t.co/CRxg4LuwFu
github.com
Contribute to fulcrumresearch/quibbler development by creating an account on GitHub.
0
0
4
Introducing Quibbler MCP! Now you can use Quibbler to critique any coding agent, not just Claude Code. Here's a demo of Quibbler working on Cursor:
1
4
12
.@fulcrumML, one of the highest firepower s25 teams I worked with is open-sourcing some practical tools for coding agents today: orchestration, monitoring, and critique. The best AI tools don't just make models more capable, they make people more capable. Orchestra from the
Today, we’re excited to announce that we’re open-sourcing two tools: Quibbler is a background agent that critiques your coding agent’s actions. Orchestra is a multi-agent coding system: it uses a designer agent that spawns and coordinates your parallel coding agents.
6
7
32
Real-time oversight of agent systems will be critical as agents scale. If you’re interested, we’d love to hear from you. We’re hiring!
0
0
7
We think oversight features, like the ones we built into Quibbler and Orchestra, are what makes it possible for parallelization to be useful, and not destructive.
1
0
8
In the worst case, your attention is spent on the behavior of your agents: preventing them from taking unsafe actions, making sure they're not lying to you, and understanding if their outputs are correct.
1
0
5
In the best case, coding with agents allows your attention to be spent on the “right” parts of your code: its functionality, architecture, and its failure-modes.
1
0
5
Orchestra enables true multi-agent coding: parallel execution, active coordination, and full visibility of your coding agents. A designer spawns executors working in isolated environments. When an executor needs help, it messages the designer (getting your input if needed).
1
0
5
Quibbler uses hooks to review what your agents are doing, making sure they’re running tests, following your coding style, and not fabricating results. In longer running tasks, we found Quibbler useful in enforcing intent, allowing us to check in on our agent less.
1
0
7
Try them out and install on PyPI: Quibbler: https://t.co/7t5BuKqjMb Orchestra: https://t.co/FEwolma1u2 Launch post:
1
0
9
live monitoring of agents in real-world settings allows us to learn lessons about AI control that can later be applied to x-risk mitigation. hoping this open-source launch helps control researchers experiment with more realistic threat models!
Today, we’re excited to announce that we’re open-sourcing two tools: Quibbler is a background agent that critiques your coding agent’s actions. Orchestra is a multi-agent coding system: it uses a designer agent that spawns and coordinates your parallel coding agents.
0
1
6
Today, we’re excited to announce that we’re open-sourcing two tools: Quibbler is a background agent that critiques your coding agent’s actions. Orchestra is a multi-agent coding system: it uses a designer agent that spawns and coordinates your parallel coding agents.
9
14
130
> Potemkin set up painted facades along the riverbank, so that [...] Catherine would see beautiful villages – each just a couple of inches thick. The rise of AI agents makes the Potemkin problem commonplace. Cool-looking scalable oversight & agent observability startup.
We aim to solve the problem of overseeing agents and understanding their effects on the world. Check out our post on why it will be challenging:
1
1
22
If you are building RL environments, evals, or deploying agents, contact us to try our tooling. https://t.co/a6AN2CWSmj
fulcrumresearch.ai
Research lab building tools to empower human decision making in the age of AI
2
0
20
We aim to solve the problem of overseeing agents and understanding their effects on the world. Check out our post on why it will be challenging:
fulcrumresearch.ai
In 1787, Catherine the Great sailed down the Dnieper to inspect its banks. Her trusted advisor, Governor Potemkin, set out to present those war-torn lands to her in the best possible light. Legend...
1
0
19