
Kyle Corbitt
@corbtt
Followers
16K
Following
5K
Media
209
Statuses
2K
Currently building @OpenPipeAI. Formerly @ycombinator, @google. I am always down to go on a quest.
Seattle, SF
Joined September 2012
đ Meet ART¡Eâour open-source RL-trained email research agent that searches your inbox and answers questions more accurately, faster, and cheaper than o3. Let's go deeper on how we built it. đ§ľ
40
123
997
AI-as-arbitrator feels like a very obvious niche and I'm surprised I haven't seen it productized.
Never thought I'd become a 'take your relationship problems to ChatGPT' person but when the 8yo and I have an argument it actually works really well to mutually agree on an account of events for Claude and then ask for its opinion.
2
0
20
Every consumer company gets RLHF'd, but some get more RLHF'd than others.
Weâre making GPT-5 warmer and friendlier based on feedback that it felt too formal before. Changes are subtle, but ChatGPT should feel more approachable now. You'll notice small, genuine touches like âGood questionâ or âGreat start,â not flattery. Internal tests show no rise in.
3
0
27
RT @jamievoynow: Agent Design Pattern: Parallel Rollouts. Inspired by Tree-of-Thought [1] and @corbtt's Universal Reward Function [2], lateâŚ.
0
50
0
RT @mattshumer_: One super important point: for companies using LLMs at scale, the new open source OpenAI models, combined with RL from folâŚ.
0
4
0
RT @mattshumer_: The @OpenPipeAI team continues to push towards making RL easy. You can now make agents that are superhuman at using speciâŚ.
0
6
0
RT @mattshumer_: @heyjchu I think you're describing @OpenPipeAI. Check out their work on RULER (, it's essentiallyâŚ.
0
1
0
RT @mattshumer_: @iScienceLuvr ART by @OpenPipeAI. Makes it so easy to do multi-turn agentic rollouts.
0
1
0
I generally don't post about politics; it isn't my area of expertise and isn't what my followers are here for. But if, like me, you've had a hard time understanding the situation in Gaza given the hyper-partisan reporting on both sides, this story may be interesting. I find it.
âWhen I entered Gaza the Israeli military had a rule: I was only allowed to bring in three kilos of food. As I was weighing out protein bars, trying to get under the limit, I said to my husband: âHow sinister is this?â Iâm a humanitarian aid worker. Why would there even be a
3
9
90
@dvdcrbt And if you find this useful (or just think it's cool), please âď¸ the Agent Reinforcement Trainer repo on GitHub! That'll help more people find us!.
github.com
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more! - OpenPipe/ART
0
3
40