Max Bartolo
@max_nlp
Followers
3K
Following
3K
Media
65
Statuses
823
Researcher @GoogleDeepMind & co-chair @DynabenchAI @MLCommons. Previously @Cohere, @MetaAI/FAIR & @BloomsburyAI.
Joined November 2016
Just arrived at #EMNLP2025 in Suzhou. Looking forward to meeting with everyone! Will be giving an oral presentation of our paper No Need For Explanations: LLMs Can Implicitly Learn from Mistakes In Context this Friday 7th November at 11.30 am in Hall A108 🎤
1
8
81
Cohere Labs x EMNLP 2025: "No Need for Explanations: LLMs can implicitly learn from mistakes in-context" This work demonstrates that eliminating explicit corrective rationales from incorrect answers improves Large Language Models' performance in math reasoning tasks,
0
3
12
Thanks for having me! Fantastic to see such innovative research happening at @imperialcollege and inspiring to meet so many brilliant students shaping the future of AI research!
0
3
25
A new season of the ICARL seminars begins🥳🥳 This year we are kicking off with @max_nlp from @GoogleDeepMind! Max will be sharing his insights through various works with LLMs and RL. 🗓️Oct 17th, 5 pm 📍Huxley Building, Imperial College London, room 311 @ICComputing This is
eventbrite.com
Context: From Tokens to Capabilities
1
3
7
Atla automatically detects errors in your AI agents. Today we're live on Product Hunt 😺 We'd love your support and feedback! https://t.co/VvaXeStiz8 PS - to celebrate, we’re releasing the real launch video.
6
3
25
One internship. Two top-tier papers: an #EMNLP2025 oral AND a #NeurIPS2025 spotlight 🤯 While publications are far from everything, this is still a pretty remarkable achievement. HUGE congrats to @LisaAlazraki and team! 👏
✨ Accepted as a Spotlight at #NeurIPS2025! Huge thanks to my coauthors and everyone who supported us. Check out the details below 👇
1
3
82
This will be an oral! 🎤 See you at #EMNLP25
1
4
28
Today we’re launching Atla — the improvement engine for AI agents. Atla helps agent builders find and fix recurring failures. Instead of just surfacing traces, Atla automatically identifies your agent’s most critical failure patterns and suggests targeted fixes.
12
17
61
Super proud of @LisaAlazraki and team (@maximilianmozes @jaa_campos @yichern_tan) on the first of her two @cohere internship papers getting accepted to #EMNLP2025! 👏
0
0
26
🚨Life update:🚨 After 3 wonderful years, I’ve decided it’s time for me to move on from Cohere. I'm incredibly grateful to have been trusted with building out Cohere's post-training capabilities -- from our first Command Nightly models that topped the HELM leaderboard, to Command
18
3
188
I’m building a new team at @GoogleDeepMind to work on Open-Ended Discovery! We’re looking for strong Research Scientists and Research Engineers to help us push the frontier of autonomously discovering novel artifacts such as new knowledge, capabilities, or algorithms, in an
job-boards.greenhouse.io
91
262
3K
Some of the real-world challenges of building for representation
This is one of my favorite sections in the Aya dataset paper. It is towards the end of the paper, so probably isn't read often. It speaks to how the end breakthrough was completely intertwined with the geo-reality experienced by independent researchers around the world.
0
0
8
NeurIPS is pleased to officially endorse EurIPS, an independently-organized meeting taking place in Copenhagen this year, which will offer researchers an opportunity to additionally present their accepted NeurIPS work in Europe, concurrently with NeurIPS. Read more in our blog
10
122
805
🎤 Meet our expert panelists! Join Albert Gu, Alisa Liu, Kris Cao, Sander Land, and Yuval Pinter as they discuss the Future of Tokenization on July 18 at 3:30 PM at TokShop at #ICML2025.
0
10
38
Really enjoyed discussing the state of AI benchmarking alongside Prof Mark Bishop, @IAmTimNguyen, Enzo Blindow & @ecsquendor at @MLStreetTalk's first in-person event in London yesterday. Looking forward to many more!
1
3
19
LLMs can be programmed by backprop 🔎 In our new preprint, we show they can act as fuzzy program interpreters and databases. After being ‘programmed’ with next-token prediction, they can retrieve, evaluate, and even *compose* programs at test time, without seeing I/O examples.
4
57
315
We’re looking for a Research Engineer / Scientist with a focus on Data Analysis and Evaluation to join the post-training team at Cohere! More details and application here: https://t.co/cwlKrvZfrH Feel free to reach out if you'd like to know more!
jobs.ashbyhq.com
Play a pivotal role in ensuring the quality, reliability, and performance of our large language models (LLMs).
4
19
114
Looking forward to sharing some of our recent research contributions at @MLStreetTalk's first London AI meetup 🤩
We are running our first physical event in London on 14th July! We have Tim Nguyen @IAmTimNguyen from DeepMind and Max Bartolo @max_nlp from Cohere and Enzo Blindow (VP of Data, Research & Analytics) at @Prolific joining us. Not many seats for the first one.
0
3
21
Kudos to @cohere for releasing 6 proper research papers in May alone, while publications of other western labs increasingly read like advertisements! I recently read the Command A technical report and it contains much more detail than other model reports. Looking at recent
2
14
150
the command-a paper is one of my top 5 papers of the year for sure https://t.co/faBHnOOqUx
5
18
312