Daniel Watson Profile
Daniel Watson

@watson_nn

Followers
1K
Following
742
Media
10
Statuses
95

Research Scientist @GoogleDeepMind. Opinions my own. 🇵🇦

Toronto, Ontario
Joined October 2017
Don't wanna be here? Send us removal request.
@watson_nn
Daniel Watson
1 month
A much needed step in world modeling. By generating 360° panos, world models will truly track everything happening around them, serving as a better form of memory. Huge shoutout to @Dazitu_616 for spearheding this awesome work. It was amazing to work with you!
@Dazitu_616
Ziyi Wu
1 month
📢Introducing 360Anything, our method for lifting any perspective image or video to gravity-aligned 360° panoramas without using any camera or 3D information. This enables consistent novel view synthesis and 3D scene reconstruction. Project page: https://t.co/qTOEip0Jw2 🧵
0
0
6
@DBahdanau
🇺🇦 Dzmitry Bahdanau
10 months
Adam deserves the award, but in Singapore everyone still uses SGD
23
63
788
@watson_nn
Daniel Watson
2 years
All results are unedited outputs from a single diffusion model. There is no NeRF or any postprocessing steps involved. Please check out our website for many more samples! https://t.co/yojdRhgNEn
0
1
8
@watson_nn
Daniel Watson
2 years
While still nowhere near perfect, 4DiM excels at very difficult cases like 360º from a single image:
1
1
6
@watson_nn
Daniel Watson
2 years
Or create videos in between images (without any camera poses):
1
0
3
@watson_nn
Daniel Watson
2 years
4DiM can also manipulate videos, e.g. we can re-create them with different cameras:
1
0
3
@watson_nn
Daniel Watson
2 years
Scenes are difficult because there is little training data with camera poses, and poses are usually noisy / lack meaningful scale. To get around this, we leverage large-scale (unposed) video data. This lets us also control time:
1
0
2
@watson_nn
Daniel Watson
2 years
[[THREAD]] Happy to announce 4DiM, our diffusion model for novel view synthesis of scenes! 4DiM allows camera+time control with as few as one input image. Joint work with @srbhsxn* @lala_yi_li* @taiyasaki @fleet_dj *equal contribution
2
17
57
@AziziShekoofeh
Shek Azizi
2 years
Excited to share latest ✨Med-Gemini✨ additions - our new research unlocks possibilities in medical data analysis with 3 new models built upon Gemini 1.5 that can handle 2D medical images, and for the first time genomic risk score & 3D radiology scans. https://t.co/QKz0NZqpDH
6
59
311
@Sichinain
Siddhant Jain
2 years
Excited to share our work on Video Interpolation with Diffusion Models. https://t.co/Ckt7MtiTnf VIDIM generates plausible short videos given a start and end frame. Joint work with @watson_nn, @erictabellion , @holynski_ , @poolio and @jannekontkanen
5
18
102
@watson_nn
Daniel Watson
2 years
Very excited to share VIDIM, our diffusion model for video interpolation. Cheap and high quality video diffusion will become a reality. I have also believed for a while now that using pixels as inputs is severely underrated-- there is so much more signal in there v.s. text.
@jannekontkanen
Janne Kontkanen
2 years
VIDIM: Video Frame Interpolation Using Diffusion Models. Watch out for our generative frame interpolation magic in CVPR 2024. With Siddhant Jain, Daniel Watson, Eric Tabellion, Aleksander Holynski, and Ben Poole https://t.co/3DUlBMdQ6Y
2
2
21
@JonathanHeek
Jonathan Heek
2 years
Fast sampling with 'Multistep Consistency Models': We get 1.6 FID on Imagenet64 in 4 steps and scale text-to-image models, generating 256x256 images with 16 steps. Guess which row is distilled? With @emiel_hoogeboom @TimSalimans Arxiv: https://t.co/BH7HzIGsgI
2
18
84
@karpathy
Andrej Karpathy
2 years
# on shortification of "learning" There are a lot of videos on YouTube/TikTok etc. that give the appearance of education, but if you look closely they are really just entertainment. This is very convenient for everyone involved : the people watching enjoy thinking they are
686
3K
17K
@AziziShekoofeh
Shek Azizi
2 years
Hiring Research Scientists within @GoogleDeepMind - Toronto to join our team & advance the next generation of medical AI, develop cutting-edge LLMs & Multi-modal models to tackle real-world healthcare challenges. Please submit your interest through: https://t.co/FJBs3h7Nvr
Tweet card summary image
docs.google.com
Hiring Research Scientists within Google DeepMind (GDM) - Toronto to join our team & advance the next generation of medical AI, develop cutting-edge foundation models including LLMs and Multi-modal...
4
50
255
@gabrielsilva8_7
Gabriel Silva
2 years
Las palabras se las lleva el viento, Presidente En educación: seguimos fracasando las pruebas PISA Transparencia: No hay mejoras desde el 2019 (Transparencia Internacional) Pobreza: incrementa desde 2019 (Banco Mundial) Homicidio: incrementa desde 2019 (Procuraduría)
13
138
372
@priyankjaini
Priyank Jaini
2 years
We have a student researcher opportunity in our team @GoogleDeepMind in Toronto 🍁 If you’re excited about research on diffusion models, and generative video models, please fill the form : https://t.co/3svxJfm8nO and apply here: https://t.co/82FhJvhV4B
Tweet card summary image
deepmind.google
Collaborate with leading thinkers at Google DeepMind. Build AI that benefits humanity.
3
27
110
@holynski_
Aleksander Holynski
2 years
Excited to share ReconFusion! 3D reconstruction of real-world scenes from only a few photos, powered by diffusion priors: https://t.co/BjOBi7bIth w/ amazing team @ChrisWu6080 @BenMildenhall @philipphenzler @KeunhongP @RuiqiGao @watson_nn @_pratul_ @dorverbin @jon_barron @poolio
9
59
341
@poolio
Ben Poole
2 years
Logging into Twitter after the CVPR deadline...
0
6
145
@RicardoLombanaG
Ricardo Lombana
2 years
El Procurador General de la Administración ha concluido SIETE violaciones a la Constitución Política de la República de Panamá en la Ley 406 (contrato minero) y cierra magistralmente su escrito con esta cita del ilustre Carlos Iván Zúñiga: "hay momentos supremos y de conciencia
53
444
984