corbtt Profile Banner
Kyle Corbitt Profile
Kyle Corbitt

@corbtt

Followers
16K
Following
5K
Media
209
Statuses
2K

Currently building @OpenPipeAI. Formerly @ycombinator, @google. I am always down to go on a quest.

Seattle, SF
Joined September 2012
Don't wanna be here? Send us removal request.
@corbtt
Kyle Corbitt
4 months
🚀 Meet ART·E—our open-source RL-trained email research agent that searches your inbox and answers questions more accurately, faster, and cheaper than o3. Let's go deeper on how we built it. 🧵
Tweet media one
40
123
997
@corbtt
Kyle Corbitt
24 hours
AI-as-arbitrator feels like a very obvious niche and I'm surprised I haven't seen it productized.
@KelseyTuoc
Kelsey Piper
1 day
Never thought I'd become a 'take your relationship problems to ChatGPT' person but when the 8yo and I have an argument it actually works really well to mutually agree on an account of events for Claude and then ask for its opinion.
2
0
20
@corbtt
Kyle Corbitt
5 days
Every consumer company gets RLHF'd, but some get more RLHF'd than others.
@OpenAI
OpenAI
5 days
We’re making GPT-5 warmer and friendlier based on feedback that it felt too formal before. Changes are subtle, but ChatGPT should feel more approachable now. You'll notice small, genuine touches like “Good question” or “Great start,” not flattery. Internal tests show no rise in.
3
0
27
@corbtt
Kyle Corbitt
5 days
RT @jamievoynow: Agent Design Pattern: Parallel Rollouts. Inspired by Tree-of-Thought [1] and @corbtt's Universal Reward Function [2], late….
0
50
0
@corbtt
Kyle Corbitt
6 days
RT @mattshumer_: One super important point: for companies using LLMs at scale, the new open source OpenAI models, combined with RL from fol….
0
4
0
@corbtt
Kyle Corbitt
6 days
RT @mattshumer_: The @OpenPipeAI team continues to push towards making RL easy. You can now make agents that are superhuman at using speci….
0
6
0
@corbtt
Kyle Corbitt
6 days
RT @mattshumer_: @heyjchu I think you're describing @OpenPipeAI. Check out their work on RULER (, it's essentially….
0
1
0
@corbtt
Kyle Corbitt
6 days
No.
@JacksonAtkinsX
Jackson Atkins
14 days
@corbtt Hey @OpenPipeAI can you stop winning?.
0
0
7
@corbtt
Kyle Corbitt
6 days
RT @fgblanch: @eugeneyan Not SFT but I would consider ART by @OpenPipeAI.
0
1
0
@corbtt
Kyle Corbitt
6 days
RT @mattshumer_: @iScienceLuvr ART by @OpenPipeAI. Makes it so easy to do multi-turn agentic rollouts.
0
1
0
@corbtt
Kyle Corbitt
8 days
I generally don't post about politics; it isn't my area of expertise and isn't what my followers are here for. But if, like me, you've had a hard time understanding the situation in Gaza given the hyper-partisan reporting on both sides, this story may be interesting. I find it.
@humansofny
Brandon Stanton
9 days
“When I entered Gaza the Israeli military had a rule: I was only allowed to bring in three kilos of food. As I was weighing out protein bars, trying to get under the limit, I said to my husband: ‘How sinister is this?’ I’m a humanitarian aid worker. Why would there even be a
Tweet media one
3
9
90
@corbtt
Kyle Corbitt
13 days
Killer feature for the ChatGPT UI would be a "flag this answer as terrible" button. I want to see a filtered list of all my failed chats so I can immediately retry them with GPT-5.
2
0
17
@corbtt
Kyle Corbitt
14 days
GitHub: Discord:
0
1
15
@corbtt
Kyle Corbitt
14 days
Agent Reinforcement Trainer has taken off like a rocket since we launched RULER a couple weeks ago. Today, we passed 5,000 stars on GitHub!. The community is super friendly and active and it has never been easier to get started with RL. Come join us on GitHub/Discord!
Tweet media one
16
24
420
@corbtt
Kyle Corbitt
14 days
@dvdcrbt And if you find this useful (or just think it's cool), please ⭐️ the Agent Reinforcement Trainer repo on GitHub! That'll help more people find us!.
Tweet card summary image
github.com
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more! - OpenPipe/ART
0
3
40
@corbtt
Kyle Corbitt
14 days
all credit to @dvdcrbt, this was his project 🙂.
5
1
41
@corbtt
Kyle Corbitt
14 days
MCP•RL is fully open source and is released as part of the Agent Reinforcement Trainer (ART) project. We have an example notebook training Qwen2.5 to use an MCP server here!
4
7
118
@corbtt
Kyle Corbitt
14 days
How does it work? When you connect a server, MCP•RL:. 1. Queries the server to get a list of tools.2. Uses a strong model to brainstorm tasks that the tools might be useful for.3. Tries to complete the task using the tools.4. Improves using RULER. In practice, it trains great!
Tweet media one
2
7
115
@corbtt
Kyle Corbitt
14 days
Announcing MCP•RL: teach your model how to use any MCP server automatically using reinforcement learning!. Just connect any MCP server, and your model will start playing with it and (using RL) "learn from experience" how to use its tools most effectively!
Tweet media one
55
198
2K
@corbtt
Kyle Corbitt
15 days
@vikhyatk
vik
15 days
Interesting take from the HF comments. Would make sense that it's pretrained primarily on synthetic data vs internet text -- reduces the risk of jailbreaks, accidental harmful content, copyright etc. (I still think it's a useful model though!)
Tweet media one
2
0
20
@corbtt
Kyle Corbitt
15 days
symptom:
@bjoern_pl
BjĂśrn PlĂźster
15 days
gpt-oss 120B is very blatantly incapable of producing linguistically correct german text. 🧵.
1
1
20