Daniel Winter
@_daniel_winter_
Followers
102
Following
55
Media
10
Statuses
33
We introduce ObjectDrop, our recent @GoogleAI project, aimed at achieving photorealistic object removal and insertion. Explore our project page: https://t.co/GOj5uAIF3v Arxiv: https://t.co/0tNxic4mUI
8
34
131
Excited to share this has now been accepted at #NeurIPS2025 as a position paper (<6% acceptance)!π We advocate for systematically studying entire model populations via weight-space learning, and argue that this requires charting them in a Model Atlas. @NeurIPSConf #NeurIPS π§΅π
π¨ New paper alert! π¨ Millions of neural networks now populate public repositories like Hugging Face π€, but most lack documentation. So, we decided to build an Atlas πΊοΈ Project: https://t.co/1JpsC6dCeg Demo: https://t.co/4Xy7yLdIZY π§΅ππ» Here's what we found:
0
21
64
We are releasing a paper I'm very excited about. We know test-time scaling is a path to greatly improved results, and achieves reasoning in the case of LLMs. We present a new and promising way to amortize it into training using HyperNetworks for image generation models.
8
45
314
@karpathy Thanks for the inspiring talk (as always!). I'm the author of the Model Atlas. I'm delighted you liked our work, seeing the figure in your slides felt like an "achievement unlocked"πWould really appreciate a link to our work in your slides/tweet https://t.co/rJjBhMmRjf
Nice - my AI startup school talk is now up! Chapters: 0:00 Imo fair to say that software is changing quite fundamentally again. LLMs are a new kind of computer, and you program them *in English*. Hence I think they are well deserving of a major version upgrade in terms of
1
9
48
When we "erase" a concept from a diffusion model, is that knowledge truly gone? π€ We investigated, and the answer is often 'no'! Using simple probing techniques, the knowledge traces of the erased concept can be easily resurfaced π Here is what we learned π§΅π
1
8
33
A quick look on what we were working on in the past months π more exciting things to come ππ»
Add and remove objects π« insert or remove items or characters in your videos all while matching the consistency and style of your scene. π We can remove a spaceship from the backdrop. π¦ And add a rubber duck to a panning shot.
0
1
4
Google presents LightLab Controlling Light Sources in Images with Diffusion Models
14
154
867
In our #ICLR2025 paper, we introduce WIND π¬οΈ A method that embeds a distortion watermark directly in the diffusion noise! Our method ensures that the watermark in one image does not reveal information about the watermark in other images π€« π https://t.co/jmEw4g3scN (1/5)
1
21
64
π¨ New paper alert! π¨ Millions of neural networks now populate public repositories like Hugging Face π€, but most lack documentation. So, we decided to build an Atlas πΊοΈ Project: https://t.co/1JpsC6dCeg Demo: https://t.co/4Xy7yLdIZY π§΅ππ» Here's what we found:
5
18
86
Crazy work by colleagues in my team!
π Excited to share ObjectMate, our latest @GoogleAI project! A new approach to zero-shot subject-driven generation and object insertion. π Explore our project page: https://t.co/8ymD8QiiIT π Arxiv: https://t.co/Mvdr4lY5Wk
1
2
15
π¨ Excited to share ObjectMate our latest from @GoogleAI for zero-shot subject-driven generation & insertion π¨ π project page:Β https://t.co/uBeJgccH3W πArxiv:Β https://t.co/J2Ya8TLLeP
arxiv.org
This paper introduces a tuning-free method for both object insertion and subject-driven generation. The task involves composing an object, given multiple views, into a scene specified by either an...
π Excited to share ObjectMate, our latest @GoogleAI project! A new approach to zero-shot subject-driven generation and object insertion. π Explore our project page: https://t.co/8ymD8QiiIT π Arxiv: https://t.co/Mvdr4lY5Wk
0
1
7
Many thanks to my great collaborators @ShulAsaf Matan Cohen @DanaBerman9 @Yxp52492 Alex Rav-Acha @YHoshen
0
0
1
5/5 For object insertion, we leveraged an ObjectDrop ( https://t.co/qqmKcCLsZP) data synthesis stage, where we remove objects with their shadows/reflections. This creates high-quality background images for supervised training, leading to SOTA results with either 1 or 3 references.
1
0
0
4/5 Our dataset powers supervised subject-driven generation. Given 3 reference images of an object and a text prompt, our model generates the object in a new context with remarkable identity preservation - all without test-time fine-tuning.
1
0
0
3/5 We also find that larger datasets have higher rates of repeating objects. From a web-scale dataset with 55M detected objects, we extracted 4.5M objects, each with at least 3 distinct views with diverse poses and scenes.
1
0
0
2/5 But finding these repetitions requires specialized similarity features. While others have used semantic features like CLIP or DINO, we show that itβs crucial to use tailored features for instance retrieval.
1
0
1
1/5 Our analysis reveals that large datasets like WebLI contain objects that reappear in different poses and scenes (e.g., car models, laptops, IKEA furniture). We call this the Object Recurrence Prior. We used it to create a dataset of 4.5M objects, each with multiple views.
1
0
1
π Excited to share ObjectMate, our latest @GoogleAI project! A new approach to zero-shot subject-driven generation and object insertion. π Explore our project page: https://t.co/8ymD8QiiIT π Arxiv: https://t.co/Mvdr4lY5Wk
2
9
58
I'm sharing something unique we've been making at Google (w/ UNC). We are releasing our work on a new class of interactive experiences that we call generative infinite games, essentially video games where the game mechanics and graphics are fully subsumed by generative models π§΅
122
280
2K
Giving a talk about common neurons in vision models and emergent representations in diffusion model weights today at @eccvconf βΊοΈ
We are organizing a new workshop on "Knowledge in Generative Models" at #ECCV2024 to explore how generative models learn representations of the visual world and how we can use them for downstream applications. For the schedule and more details, visit our website: πWebsite:
1
3
49
With friends at @Google we announce π Magic Insert π - a generative AI method that allows you to drag-and-drop a subject into an image with a vastly different style achieving a style-harmonized and realistic insertion of the subject (Thread π§΅) web: https://t.co/32gsziKge1
28
107
551