MatternJustus Profile Banner
Justus Mattern Profile
Justus Mattern

@MatternJustus

Followers
6K
Following
3K
Media
134
Statuses
954

RL for code | prev. research @PrimeIntellect, @MPI_IS and built revideo

San Francisco, CA
Joined March 2021
Don't wanna be here? Send us removal request.
@Designarena
Design Arena
6 days
GPT-Image-1.5 has taken #1 on Image Arena This marks a sizeable 9-place improvement since GPT-Image-1, establishing the new frontier for Image Generation Congratulations to the team at @OpenAI for this significant contribution
31
65
521
@MatternJustus
Justus Mattern
8 days
Interesting case of GPT-5.1 remembering its training harness when only given a bash tool ("the editing helper I usually use isn’t available in this environment")
1
3
48
@MatternJustus
Justus Mattern
11 days
the fact that a company that has raised over 60M dollars and is able to recruit top AI researchers uses Tinker rather than in-house infra to train frontier models is an incredibly positive sign that Tinker can be used for serious large scale training runs
@thinkymachines
Thinking Machines
11 days
Congratulations to @axiommathai on their achievement! AxiomProver, a mathematics model fine-tuned with Tinker, got top scores on the Putnam Math Competition.
9
12
544
@MatternJustus
Justus Mattern
16 days
update: I did not go to Neurips due to urgent work needs :(
@MatternJustus
Justus Mattern
19 days
I'll be at Neurips starting Thursday - would love to chat about post-training for code and SWE evals / RL environments!
5
0
54
@MatternJustus
Justus Mattern
19 days
I'll be at Neurips starting Thursday - would love to chat about post-training for code and SWE evals / RL environments!
5
2
54
@MatternJustus
Justus Mattern
25 days
Excited about these results! Was a lot of fun building the RL stack with the team ❤️
@PrimeIntellect
Prime Intellect
25 days
Introducing INTELLECT-3: Scaling RL to a 100B+ MoE model on our end-to-end stack Achieving state-of-the-art performance for its size across math, code and reasoning Built using the same tools we put in your hands, from environments & evals, RL frameworks, sandboxes & more
4
0
94
@MatternJustus
Justus Mattern
1 month
TIL that „web applications“ are just a docker container that log events to observability tools
@nlarusstone
Nicholas Larus-Stone
1 month
TIL that “RL envs” are just a docker container that log events to JSON files…
0
0
19
@MatternJustus
Justus Mattern
2 months
if you're an engineer looking for new opportunities, reach out :)
4
1
43
@MatternJustus
Justus Mattern
2 months
so excited about what I'm working on rn
5
1
87
@latkins
Lucas Atkins
2 months
Posted without comment.
@Presidentlin
Lincoln 🇿🇦
2 months
I made this. Jokes aside, devs want big and small models. Trinity is coming soon.
10
9
116
@silasalberti
Silas Alberti
2 months
a long-term investment that is paying off is investing heavily into evals (which we can use RL environments) we aim for a hyper-realistic task distribution and use a variety of different grading techniques: - classical tests (e.g. unit tests, integration tests) for reliably
1
2
31
@MatternJustus
Justus Mattern
2 months
Update: After an incredible year at @PrimeIntellect, I have decided to take my next step in August. Grateful that I got to work with such a talented team and build the best open-source RL infra! For now, I'm continuing to work on RL for coding agents. Will share updates :)
31
8
399
@niloofar_mire
Niloofar
2 months
I'm recruiting students for fall 2026 thru @LTIatCMU & @CMU_EPP, in: 1. Privacy & security of LLMs, coding, long horizon & embodied agents (robotics) 2. Tiny local llms 3. AI for scientific reasoning, esp. chemistry 4. Latent reasoning 5. anything YOU are passionate about!
26
188
1K
@MatternJustus
Justus Mattern
2 months
Next up solving competitive programming challenges from the Waymo entertainment system
@bearlyai
Bearly AI
2 months
Uber will give its drivers in the US an option to make money by doing “digital tasks”. These short minute-long tasks can be done anytime including while idling for passengers: ▫️data-labelling (for AI training) ▫️uploading restaurant menus ▫️recording audio samples of
1
0
17
@MatternJustus
Justus Mattern
2 months
many such cases
@rauchg
Guillermo Rauch
2 months
I had my first BJJ “fight” the other day. I see what the hype is all about now. All my boys are insta-enrolled. Brazil cooked.
4
0
25
@MatternJustus
Justus Mattern
2 months
start date is ASAP btw
2
0
14
@MatternJustus
Justus Mattern
2 months
my turn: if you are interested in working on coding agent research and being a core contributor to what will be an impactful paper, some Stanford friends are working on an industry collab and are looking for motivated researchers to join! Can be paid internship if full-time, DM
@MatternJustus
Justus Mattern
1 year
The only piece of advice I give to undergrads that want to get into research is to cold email PhD students with a good track record. Most undergrads are bottlenecked by research ideas whereas good PhD students have way too many ideas that they cannot execute. If you can code
16
24
377
@MatternJustus
Justus Mattern
2 months
there's nothing like the feeling of looking up from your computer and going for a walk after an incredibly stressful 8h lock-in without food or breaks to meet a deadline
2
0
34