Michael C. Mozer Profile
Michael C. Mozer

@mc_mozer

Followers
725
Following
13
Media
3
Statuses
19

Research Scientist, Google Brain now DeepMind where cognitive science and machine learning meet

San Francisco, CA
Joined January 2022
Don't wanna be here? Send us removal request.
@mc_mozer
Michael C. Mozer
20 days
[3/4] To train the model to calibrate its uncertainty and use <don't know> outputs judiciously, we frame the selection of each output token as a sequential-decision problem with a time penalty. We refer to the class of methods as โ€œCatch Your Breathโ€ losses.
0
0
5
@mc_mozer
Michael C. Mozer
20 days
[2/4] The model can request additional compute steps for any token by emitting a <don't know> output. If the model is granted a delay, a <pause> token is inserted at the next input step, providing the model with additional compute resources to generate an output.
0
0
4
@mc_mozer
Michael C. Mozer
20 days
[1/4] As you read words in this text, your brain adjusts fixation durations to facilitate comprehension. Inspired by human reading behavior, we propose a supervised objective that trains an LLM to dynamically determine the number of compute steps for each input token.
4
10
25
@dannypsawyer
Danny Sawyer
27 days
Happy to announce that our work has been accepted to workshops on Multi-turn Interactions and Embodied World Models at #NeurIPS2025! Frontier foundation models are incredible, but how well can they explore in interactive environments? Paper๐Ÿ‘‡ https://t.co/8Q9j1VMTYv ๐Ÿงต1/13
1
5
23
@_EffieLi_
Effie Li
1 month
๐ŸŒŸTo appear in the MechInterp Workshop @ #NeurIPS2025 ๐ŸŒŸ Paper: https://t.co/fJS0eripxX How do language models (LMs) form representation of new tasks, during in-context learning? We study different types of task representations, and find that they evolve in distinct ways. ๐Ÿงต1/7
1
15
105
@ShoaibASiddiqui
Shoaib Ahmed Siddiqui
5 months
[๐Ÿ“œ1/9] Does machine unlearning truly erase data influence? Our new paper reveals a critical insight: 'forgotten' information often isn't goneโ€”it's merely dormant, and easily recovered by fine-tuning on just the retain set.
2
11
51
@KarchitK
Archit Karandikar
11 months
We are announcing the launch of Airial Travelโ€™s open-to-all beta version for desktop today. Airial is your personal travel agent with AI superpowers which makes planning and booking trips as easy as dreaming them up. https://t.co/KKO8D5XnEn Me and Sanjeev co-founded Airial
9
12
45
@agopal42
Anand Gopalakrishnan
1 year
Excited to present "Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery" at #NeurIPS2024! TL;DR: Our model, SynCx, greatly simplifies the inductive biases and training procedures of current state-of-the-art synchrony models. Thread ๐Ÿ‘‡ 1/x.
2
41
165
@Michael_Lepori
Michael Lepori
1 year
The ability to properly contextualize is a core competency of LLMs, yet even the best models sometimes struggle. In a new preprint, we use #MechanisticInterpretability techniques to propose an explanation for contextualization errors: the LLM Race Conditions Hypothesis. [1/9]
5
15
103
@mengyer
Mengye Ren
2 years
๐Ÿ” New LLM Research ๐Ÿ” Conventional wisdom says that deep neural networks suffer from catastrophic forgetting as we train them on a sequence of data points with distribution shifts. But conventions are meant to be challenged! In our recent paper led by @YanlaiYang, we discovered
3
40
217
@gamaleldinfe
Gamaleldin Elsayed
2 years
Nature Comms paper: Subtle adversarial image manipulations influence both human and machine perception! We show that adversarial attacks against computer vision models also transfer (weakly) to humans, even when the attack magnitude is small. https://t.co/O7skDZe6zU
12
89
386
@doomie
Dumitru Erhan
3 years
1/ Today we are excited to introduce Phenaki: https://t.co/7xkcoeuXwB, short-link-to-paper, a model for generating videos from text, with prompts that can change over time, and that is able to generate videos that can be as long as multiple minutes!
36
393
2K
@sundarpichai
Sundar Pichai
3 years
Two important breakthroughs from @GoogleAI this week - Imagen Video, a new text-conditioned video diffusion model that generates 1280x768 24fps HD video. And Phenaki, a model which generates long coherent videos for a sequence of text prompts. https://t.co/nTs67r21Sf
56
283
2K
@tkipf
Thomas Kipf
3 years
We are excited to make the jump to complex real-world data with this class of models โ€” and about the potential that slot-based models have for reducing the need for detailed human supervision when learning about the physical world. 6/7
1
1
6
@tkipf
Thomas Kipf
3 years
Excited to share our work on self-supervised video object representation learning: We introduce SAVi++, a slot-based video model that โ€” for the first time โ€” scales to Waymo Open driving scenes w/o direct supervision. ๐Ÿ–ฅ๏ธ https://t.co/eBAW2ijs6c ๐Ÿ“œ https://t.co/tbjZWgdQEK 1/7
3
33
174
@mc_mozer
Michael C. Mozer
4 years
Overcoming temptation: Incentive design for intertemporal choice https://t.co/SalyyRQHpd We use AI models to help individuals adhere to long-term goals (e.g., retirement savings, weight loss) and avoid giving in to temptation.
1
5
14
@mc_mozer
Michael C. Mozer
4 years
My first tweet
20
1
88