DevinWhiteAI Profile Banner
Devin White Profile
Devin White

@DevinWhiteAI

Followers
19
Following
179
Media
14
Statuses
63

ML Researcher @USAEOP. Pushing RLHF forward & using LLMs to master gameplay.

Joined February 2024
Don't wanna be here? Send us removal request.
@DevinWhiteAI
Devin White
1 month
Thank you to everyone who stopped by our presentations yesterday! I had a great time sharing our work and chatting with so many of you.
0
0
0
@DevinWhiteAI
Devin White
1 month
If you're at #ICML2025 🇨🇦, join us today for our "Too Big to Think" oral presentation at 9:30AM (Room 215-216) and "Multi-Task Reward Learning from Human Ratings" poster at 12PM (Ballroom A)! See you there! #TinyTitans #RLHF.
0
0
0
@grok
Grok
6 days
What do you want to know?.
491
314
2K
@DevinWhiteAI
Devin White
2 months
Big news! 🎉 Our paper “Multi-Task Reward Learning from Human Ratings” was accepted to the Models of Human Feedback for AI Alignment workshop at #ICML2025!. In this paper we treat ratings not just as class labels, but as rich reward signals with underlying structure and scale.
0
1
3
@DevinWhiteAI
Devin White
3 months
🚨Exciting news!🚨.Our paper, “Too Big to Think: Capacity, Memorization, and Generalization in Pre-Trained Transformers”, was accepted for an oral presentation at the Tiny Titans: The next wave of On-Device Learning for Foundational Models workshop (@tinytitans_icml) at.
0
1
3
@DevinWhiteAI
Devin White
3 months
🚨 Big news! Simple RbRL now has a sleek, user-friendly interface! 🖥️✨. You can now:.✅ Rate trajectories directly in the UI.✅ Train RL agents with human feedback.✅ Explore Rating-based RL hands-on with ease. This lightweight, open-source tool makes RbRL accessible to.
0
0
3
@DevinWhiteAI
Devin White
4 months
RT @rowancheung: OpenAI just dropped a GitHub connector for ChatGPT’s Deep Research. Now you can plug into GitHub repos to search code, sca….
0
22
0
@DevinWhiteAI
Devin White
5 months
🚀 Big update to Atari-GPT!. ✨ Progress bar during testing (steps & reward) .✨ Cleaner function definitions for ease of use .✨ Easy game/model selection via CLI .✨ New analysis file to visualize results . Perfect for Atari AI fans! Try it out and share your results!.
0
0
2
@DevinWhiteAI
Devin White
6 months
🎉I’m excited to say that I have reached a small but personal milestone of 20 citations! . I want to say a huge thank you to everyone who I have had the honor of collaborating with and I'm excited for what's next!
Tweet media one
1
0
3
@DevinWhiteAI
Devin White
6 months
Learning from Negative Feedback, Positive Feedback, or Both: RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning:
Tweet card summary image
arxiv.org
Reinforcement learning (RL), a common tool in decision making, learns policies from various experiences based on the associated cumulative return/rewards without treating them differently. On the...
0
0
1
@DevinWhiteAI
Devin White
6 months
Had a blast presenting Atari-GPT at the Toward Knowledgeable Foundation Models Workshop @RealAAAI! Check out the full paper here: #AI #AAAI2025 #LLMs.
0
0
3
@DevinWhiteAI
Devin White
6 months
RT @ManlingLi_: Today is the day! Welcome to our 2nd workshop on Knowledgeable Foundation Models in Room 112. Come and talk with these won….
0
19
0
@DevinWhiteAI
Devin White
6 months
Curious how Atari-GPT blends Atari's retro feel with advanced LLMs? Discover the magic here:
0
1
4
@DevinWhiteAI
Devin White
6 months
RT @MdSunbeam: @DevinWhiteAI Paper here:
0
1
0
@DevinWhiteAI
Devin White
6 months
At #AAAI2025? Curious if #LLMs (#Gemini, #ChatGPT, #Claude) can game? 🕹️ Join me tomorrow at 5pm EST for 'Atari-GPT' at the Toward Knowledgeable Foundation Models Workshop! Work done alongside @MdSunbeam.
1
2
8
@DevinWhiteAI
Devin White
6 months
Enhanced collision detection in my ASCII Breakout game to test GPT-4o, with this it got better results than ever! See the code and try it out: #AI #LLM
0
0
3
@DevinWhiteAI
Devin White
6 months
Exciting update on my Simple RLHF codebase! New results replicate the original RbRL paper, but cut training time by ~33% and run smoothly on modern hardware (like M-series Apple Silicon). Curious? Dive into the details here: #RLHF #AI #MachineLearning
Tweet media one
1
0
5