Daniel Watson @watson_nn X Profile

Daniel Watson

@watson_nn

Followers

1K

Following

742

Media

10

Statuses

95

Research Scientist @GoogleDeepMind. Opinions my own. 🇵🇦

Toronto, Ontario

Joined October 2017

Don't wanna be here? Send us removal request.

Daniel Watson

@watson_nn

1 month

A much needed step in world modeling. By generating 360° panos, world models will truly track everything happening around them, serving as a better form of memory. Huge shoutout to @Dazitu_616 for spearheding this awesome work. It was amazing to work with you!

Ziyi Wu

@Dazitu_616

1 month

📢Introducing 360Anything, our method for lifting any perspective image or video to gravity-aligned 360° panoramas without using any camera or 3D information. This enables consistent novel view synthesis and 3D scene reconstruction. Project page: https://t.co/qTOEip0Jw2 🧵

0

6

🇺🇦 Dzmitry Bahdanau

@DBahdanau

10 months

Adam deserves the award, but in Singapore everyone still uses SGD

23

63

788

Daniel Watson

@watson_nn

2 years

All results are unedited outputs from a single diffusion model. There is no NeRF or any postprocessing steps involved. Please check out our website for many more samples! https://t.co/yojdRhgNEn

0

1

8

Daniel Watson

@watson_nn

2 years

While still nowhere near perfect, 4DiM excels at very difficult cases like 360º from a single image:

1

6

Daniel Watson

@watson_nn

2 years

Or create videos in between images (without any camera poses):

1

0

3

Daniel Watson

@watson_nn

2 years

4DiM can also manipulate videos, e.g. we can re-create them with different cameras:

1

0

3

Daniel Watson

@watson_nn

2 years

Scenes are difficult because there is little training data with camera poses, and poses are usually noisy / lack meaningful scale. To get around this, we leverage large-scale (unposed) video data. This lets us also control time:

1

0

2

Daniel Watson

@watson_nn

2 years

[[THREAD]] Happy to announce 4DiM, our diffusion model for novel view synthesis of scenes! 4DiM allows camera+time control with as few as one input image. Joint work with @srbhsxn* @lala_yi_li* @taiyasaki @fleet_dj *equal contribution

2

17

57

Shek Azizi

@AziziShekoofeh

2 years

Excited to share latest ✨Med-Gemini✨ additions - our new research unlocks possibilities in medical data analysis with 3 new models built upon Gemini 1.5 that can handle 2D medical images, and for the first time genomic risk score & 3D radiology scans. https://t.co/QKz0NZqpDH

6

59

311

Siddhant Jain

@Sichinain

2 years

Excited to share our work on Video Interpolation with Diffusion Models. https://t.co/Ckt7MtiTnf VIDIM generates plausible short videos given a start and end frame. Joint work with @watson_nn, @erictabellion , @holynski_ , @poolio and @jannekontkanen

5

18

102

Daniel Watson

@watson_nn

2 years

Very excited to share VIDIM, our diffusion model for video interpolation. Cheap and high quality video diffusion will become a reality. I have also believed for a while now that using pixels as inputs is severely underrated-- there is so much more signal in there v.s. text.

Janne Kontkanen

@jannekontkanen

2 years

VIDIM: Video Frame Interpolation Using Diffusion Models. Watch out for our generative frame interpolation magic in CVPR 2024. With Siddhant Jain, Daniel Watson, Eric Tabellion, Aleksander Holynski, and Ben Poole https://t.co/3DUlBMdQ6Y

2

21

Jonathan Heek

@JonathanHeek

2 years

Fast sampling with 'Multistep Consistency Models': We get 1.6 FID on Imagenet64 in 4 steps and scale text-to-image models, generating 256x256 images with 16 steps. Guess which row is distilled? With @emiel_hoogeboom @TimSalimans Arxiv: https://t.co/BH7HzIGsgI

2

18

84

Andrej Karpathy

@karpathy

2 years

# on shortification of "learning" There are a lot of videos on YouTube/TikTok etc. that give the appearance of education, but if you look closely they are really just entertainment. This is very convenient for everyone involved : the people watching enjoy thinking they are

686

3K

17K

Shek Azizi

@AziziShekoofeh

2 years

Hiring Research Scientists within @GoogleDeepMind - Toronto to join our team & advance the next generation of medical AI, develop cutting-edge LLMs & Multi-modal models to tackle real-world healthcare challenges. Please submit your interest through: https://t.co/FJBs3h7Nvr

docs.google.com

Hiring Research Scientists within Google DeepMind (GDM) - Toronto to join our team & advance the next generation of medical AI, develop cutting-edge foundation models including LLMs and Multi-modal...

4

50

255

Gabriel Silva

@gabrielsilva8_7

2 years

Las palabras se las lleva el viento, Presidente En educación: seguimos fracasando las pruebas PISA Transparencia: No hay mejoras desde el 2019 (Transparencia Internacional) Pobreza: incrementa desde 2019 (Banco Mundial) Homicidio: incrementa desde 2019 (Procuraduría)

13

138

372

Priyank Jaini

@priyankjaini

2 years

We have a student researcher opportunity in our team @GoogleDeepMind in Toronto 🍁 If you’re excited about research on diffusion models, and generative video models, please fill the form : https://t.co/3svxJfm8nO and apply here: https://t.co/82FhJvhV4B

deepmind.google

Collaborate with leading thinkers at Google DeepMind. Build AI that benefits humanity.

3

27

110

Aleksander Holynski

@holynski_

2 years

Excited to share ReconFusion! 3D reconstruction of real-world scenes from only a few photos, powered by diffusion priors: https://t.co/BjOBi7bIth w/ amazing team @ChrisWu6080 @BenMildenhall @philipphenzler @KeunhongP @RuiqiGao @watson_nn @_pratul_ @dorverbin @jon_barron @poolio

9

59

341

Ben Poole

@poolio

2 years

Logging into Twitter after the CVPR deadline...

0

6

145

Ricardo Lombana

@RicardoLombanaG

2 years

El Procurador General de la Administración ha concluido SIETE violaciones a la Constitución Política de la República de Panamá en la Ley 406 (contrato minero) y cierra magistralmente su escrito con esta cita del ilustre Carlos Iván Zúñiga: "hay momentos supremos y de conciencia

53

444

984