bradthilton Profile Banner
Brad Hilton Profile
Brad Hilton

@bradthilton

Followers
868
Following
90K
Media
312
Statuses
3K

Reinforcement Learning Research Engineer • Sometimes Political Commentator • Husband and Father • Believer in Jesus Christ

Orem, UT
Joined February 2013
Don't wanna be here? Send us removal request.
@bradthilton
Brad Hilton
1 day
looks like there's a sweet spot for planning.
@arankomatsuzaki
Aran Komatsuzaki
2 days
Learning When to Plan. LLM agents trained with dynamic planning learn when to spend test-time compute, balancing cost & performance. This is the first work to explore training LLM agents for dynamic test-time compute allocation in sequential decision-making tasks.
Tweet media one
0
0
2
@bradthilton
Brad Hilton
1 day
this would be really entertaining.
@ZohranKMamdani
Zohran Kwame Mamdani
2 days
Enough with the backroom scheming. If @realDonaldTrump is serious about intervening in the mayoral race, he should come to New York City and debate me directly.
0
0
1
@bradthilton
Brad Hilton
1 day
is this mostly due to distillation of claude models by chinese labs?.
@Yuchenj_UW
Yuchen Jin
1 day
Anthropic banned Claude in certain regions, explicitly labeling China as an “adversarial nation” in yesterday’s blog. Many Chinese people say they’re unsubscribing and switching from Claude Code to OpenAI Codex. What did Dario see during his 1 year at Baidu?
Tweet media one
0
0
1
@bradthilton
Brad Hilton
1 day
lord ruler is one of the best villains. i felt the ending to mistborn was a bit anti-climatic, but the payoff for the whole trilogy is huge. definitely worth reading.
@dannypostmaa
Danny Postma
2 days
10/10. Absolutely page turner once you get out of the “introduction part”.
Tweet media one
0
0
2
@bradthilton
Brad Hilton
2 days
RT @Yampeleg: 𝗦𝘁𝗲𝗽 𝟭: Buy a sh*tload of GPUs.𝗦𝘁𝗲𝗽 𝟮:. 𝚠𝚑𝚒𝚕𝚎 𝚃𝚛𝚞𝚎:. θᵢ ← θᵢ − lr · (∂L / ∂θᵢ).𝗦𝘁𝗲𝗽 𝟯: Profit.
0
3
0
@bradthilton
Brad Hilton
3 days
Tweet media one
1
0
2
@bradthilton
Brad Hilton
3 days
RT @dvdcrbt: @altryne @OpenPipeAI This is going to be epic!.
0
1
0
@bradthilton
Brad Hilton
3 days
RT @corbtt: Ok, some big news that I've been sitting on for a minute: @openpipeai is getting acquired by @coreweave!.
0
15
0
@bradthilton
Brad Hilton
3 days
RT @l2k: CoreWeave is buying @OpenPipeAI - I'm a big fan of the @corbtt and the ART RL library and excited to work together!.
0
6
0
@bradthilton
Brad Hilton
4 days
just 30 hours 🤯 insane.
@corbtt
Kyle Corbitt
4 days
🚨 We’ve just published a recipe to train a frontier-level deep research agent using RL. With just 30 hours on an H200, any developer can now beat Sonnet-4 on DeepResearch Bench using open-source tools. (Thread 🧵)
Tweet media one
0
0
4
@bradthilton
Brad Hilton
4 days
💯.
@Suhail
Suhail
4 days
You cannot force highly skilled people to work a set number of hours. They will just pretend to which is easy to do. They’ll read Reddit, make easy tasks stretch in time, have a longer lunch, longer meetings, etc. The only way I’ve seen or experienced seeing people work a lot of.
0
0
4
@bradthilton
Brad Hilton
5 days
@tokenbender
tokenbender
6 days
burnouts are caused by ego depletion. the deeper your conviction about "it might look so over but we would be so back again", the longer you last. and if that depth of conviction is a bottomless hole, you NEVER EVER burn out.
0
0
2
@bradthilton
Brad Hilton
5 days
burnout comes from working on something you don’t believe in.
@snowmaker
Jared Friedman
6 days
If you tell your friends you're burned out, they'll always prescribe the same thing: "You need to take a vacation". But it never works. Burnout is completely misunderstood. This new article by @bscholl is the best explanation I've seen.
1
0
6
@bradthilton
Brad Hilton
9 days
agree with the shorthand observation. the writing is a bit too dense. o3 was easier to read.
@paulnovosad
Paul Novosad
10 days
I find GPT5 "thinking" to be much less clear than o3 was. Responses are full of jargon, shorthand, much harder to use and interpret. It's like talking to someone who wants to show off his expertise but doesn't really want to help. Is it just me?.
0
0
1
@bradthilton
Brad Hilton
10 days
every startup should do this from series a on.
@karrisaarinen
Karri Saarinen
10 days
In our Series C round at @linear, we gave all current and former teammates the opportunity to sell a portion of their vested options. From the start, we’ve aimed to make Linear’s equity program as employee-friendly as possible. Now including path to liquidity.
2
0
3
@bradthilton
Brad Hilton
10 days
RT @dvdcrbt: Small update to MCP•RL: you can now automatically generate training scenarios through the art.mcp package.
0
2
0
@bradthilton
Brad Hilton
10 days
RT @corbtt: Really cool post by a community member sharing some of their results running RL with ART!.
0
6
0
@bradthilton
Brad Hilton
12 days
RT @mattshumer_: OpenPipe is the singular player bringing RL to regular devs. They make it as simple as hell to build products that learn….
0
5
0
@bradthilton
Brad Hilton
12 days
RT @corbtt: Super excited to announce the official integration between ART and LangGraph!. You can now easily train your LangGraph agents w….
0
40
0
@bradthilton
Brad Hilton
12 days
making the agent reinforcement trainer more accessible to langgraph users with this latest integration. amazing work by andie jones from openpipe!.
@corbtt
Kyle Corbitt
12 days
Super excited to announce the official integration between ART and LangGraph!. You can now easily train your LangGraph agents with reinforcement learning — automatically improving reasoning, tool use, and adaptability. More info below:
Tweet media one
0
0
4