Meera Hahn Profile
Meera Hahn

@MeeraHahn

Followers
288
Following
140
Media
2
Statuses
34

Research Scientist @GoogleAI PhD in Computer Science @GeorgiaTech Undergrad @EmoryUniversity

Atlanta, GA
Joined November 2019
Don't wanna be here? Send us removal request.
@MeeraHahn
Meera Hahn
8 months
Exciting new work from @sihyun_yu and our team at Google Deep Mind! Memory-Augmented Latent Transformers (MALT) Diffusion, a new diffusion model specialized for long video generation! https://t.co/gcDZr5mVbf
Tweet card summary image
arxiv.org
Diffusion models are successful for synthesizing high-quality videos but are limited to generating short clips (e.g., 2-10 seconds). Synthesizing sustained footage (e.g. over minutes) still...
3
17
111
@NithishKannen
Nithish Kannen
9 months
Check out our tech report on proactive T2I agents that ask clarification questions to reduce uncertainty! With this agent, we obtain 2 times higher VQAScore in just 5 turns!🤯 We've open-sourced our agent code powered by #Gemini @GoogleDeepMind! 🚀 Code: https://t.co/67dTGDQpZI
@ziwphd
Zi Wang, Ph.D.
10 months
Tired of endless prompt tweaking? We've released a tech report on proactive text-to-image agents powered by #Gemini @GoogleDeepMind! Our agents ask clarifying questions and use belief graphs to understand what you really want. https://t.co/jRwjxtqALx https://t.co/XFVvva7r96
2
22
67
@natanielruizg
Nataniel Ruiz
10 months
legit treflip by Veo 2 with just one error (wheel inversion in the middle). legs move realistically for a treflip 🤯. skateboarding videos are notoriously hard to generate
@venturetwins
Justine Moore
10 months
I cannot believe this is AI video - we've come insanely far in the last year (Veo 2)
4
6
70
@ziwphd
Zi Wang, Ph.D.
10 months
Many thanks to @MeeraHahn , Wenjun Zeng, Nithish Kannen, Rich Galt, Kartikeya Badola, @_beenkim for making this happen! Stay tuned for code release!
0
1
5
@ziwphd
Zi Wang, Ph.D.
10 months
Tired of endless prompt tweaking? We've released a tech report on proactive text-to-image agents powered by #Gemini @GoogleDeepMind! Our agents ask clarifying questions and use belief graphs to understand what you really want. https://t.co/jRwjxtqALx https://t.co/XFVvva7r96
1
5
17
@agrimgupta92
Agrim Gupta
2 years
6/ Finally, our model can be used to generate videos with consistent 3D camera motion.
3
17
93
@agrimgupta92
Agrim Gupta
2 years
2/ website: https://t.co/atH5wzRudu Our approach has two key design decisions. First, we use a causal encoder to compress images and videos in a shared latent space.
2
15
71
@agrimgupta92
Agrim Gupta
2 years
We introduce W.A.L.T, a diffusion model for photorealistic video generation. Our model is a transformer trained on image and video generation in a shared latent space. 🧵👇
51
250
1K
@ptsi
Philipp Tsipman
3 years
If you’re having trouble keeping up with Video AI😅, there have been 5 state-of-the-art generative video models released *in last 7 days*: 🤯😎🧵
61
667
7K
@MeeraHahn
Meera Hahn
3 years
Have you ever wondered about emergent intelligence in robotic agents? This work shows interesting emergent intelligence and behaviors in blind navigation agents! Blind agents learn maps as they navigate. This allows them to navigate as successfully as an agent with vision
@erikwijmans
erikwijmans
3 years
How do 'map-less' agents navigate? They learn to build implicit maps of their environment in their hidden state! We study 'blind' AI navigation agents and find the following 🧵
0
1
15
@maxxu05
Max Xu
3 years
How can we fill in missing pulsative sensor data? Prior state-of-the-art fails in our novel setting, despite its well-defined temporal structure. Checkout our #NeurIPS2022 paper, PulseImpute, @ 4 pm CST! arxiv: https://t.co/Hbv2x7ZvkP github: https://t.co/bTManuLyEH
0
11
17
@michellehuang42
michelle huang
3 years
i trained an ai chatbot on my childhood journal entries - so that i could engage in real-time dialogue with my "inner child" some reflections below:
567
6K
46K
@sstj389
Stefan Stojanov
3 years
Dense self-supervised learning from multiple 3D viewpoints → dense feature representations that generalize both to novel object instances and to novel categories of instances. Checkout our #NeurIPS2022 paper! arxiv: https://t.co/rxzdrScII1 github: https://t.co/5n98W7Wykt
2
20
68
@MeeraHahn
Meera Hahn
3 years
We model indoor environments using FPV panoramic navigation graphs and introduce a visiolinguistic transformer model, LED-Bert, which scores the alignment between navigation graph nodes and dialogs and achieves SOTA performance on the LED task!
0
0
1
@MeeraHahn
Meera Hahn
3 years
✨Transformer-based Localization from Embodied Dialog with Large-scale Pre-training✨ has been accepted as an oral at @aaclmeeting! https://t.co/L7SSGfqk0w w/ @RehgJim
1
5
19
@natanielruizg
Nataniel Ruiz
3 years
Today, along with my collaborators at @GoogleAI, we announce DreamBooth! It allows a user to generate a subject of choice (pet, object, etc.) in myriad contexts and with text-guided semantic variations! The options are endless. (Thread 👇) webpage: https://t.co/EDpIyalqiK 1/N
42
403
2K
@sstj389
Stefan Stojanov
4 years
A new year, a new shameless twitter plug: Check out our Toys4K 3D object dataset 4K instances, 105 categories, 15+ instances per category https://t.co/K85J007iru
0
12
62
@_akhaliq
AK
4 years
No RL, No Simulation: Learning to Navigate without Navigating abs: https://t.co/xWFQwed1ES
0
12
75
@MeeraHahn
Meera Hahn
4 years
Check out our video on NRNS here:
0
3
11
@MeeraHahn
Meera Hahn
4 years
Excited to present our #neurips work NRNS!
@dchaplot
Devendra Chaplot
4 years
#NeurIPS21 paper on No RL No Simulation (NRNS): Learning to Navigate without Navigating! NRNS not only beats RL/IL algorithms in simulation but when it comes to real-world…there is no sim2real required! Webpage: https://t.co/MuDOM91LJP Code: https://t.co/WBXxyrr8uY (1/2)
0
3
9