joao carreira Profile
joao carreira

@joaocarreira

Followers
1K
Following
282
Media
10
Statuses
145

Research Scientist at Google DeepMind

London, England
Joined February 2009
Don't wanna be here? Send us removal request.
@joaocarreira
joao carreira
28 days
Scaling 4D Representations – new preprint and models now available
Tweet card summary image
github.com
Contribute to google-deepmind/representations4d development by creating an account on GitHub.
3
42
203
@joaocarreira
joao carreira
23 days
3rd edition of the challenge with new exciting tasks and guest tracks; back during covid when we had the first workshop about the perception test ( some of us were afraid the benchmark was too difficult; now we just made it harder.
@nikparth1
Nikhil Parthasarathy
23 days
The 3rd Perception Test challenge is now accepting submissions ! Prizes of up to 50k EUR across Perception Test tracks are available. The winners will be announced at the Perception Test workshop at #ICCV2025. Submission deadline: October 6, 2025.
0
0
4
@joaocarreira
joao carreira
28 days
Most human knowledge was derived from our senses, part of it is passed around via language. Current AI excels on, but also depends on language. A next frontier of AI is developing the ability to create new knowledge from vision – large vision models may have a role to play there.
1
0
7
@joaocarreira
joao carreira
28 days
We found that larger models eventually surpass those trained with more sophisticated objectives, such as VJEPA, on non-semantic tasks (e.g. depth and camera pose estimation) while just matching them on higher-level tasks such as action recognition
Tweet media one
1
0
10
@joaocarreira
joao carreira
28 days
There's been little work on video models beyond 1B parameters with self-supervision (e.g. no text or labels). In this work we used the simplest, lightest masked auto-encoding learning method and explored what happens when we scale models all the way to 22B parameters
Tweet media one
1
0
8
@joaocarreira
joao carreira
30 days
RT @yanahasson: Thrilled to share our latest work on SciVid, to appear at #ICCV2025! 🎉.SciVid offers cross-domain evaluation of video model….
0
9
0
@joaocarreira
joao carreira
2 months
RT @sangwoomo: Can scaling data and models alone solve computer vision? 🤔.Join us at the SP4V Workshop at #ICCV2025 in Hawaii to explore th….
0
17
0
@joaocarreira
joao carreira
3 months
Individual frames out of generative video models tend to look reasonable; capturing actions happening over time realistically . that is way harder. TRAJAN is a new evaluation procedure to better guide progress in this (hot) area.
@KelseyRAllen
Kelsey Allen
3 months
Humans can tell the difference between a realistic generated video and an unrealistic one – can models?. Excited to share TRAJAN: the world’s first point TRAJectory AutoeNcoder for evaluating motion realism in generated and corrupted videos. 🌐 🧵
2
8
45
@joaocarreira
joao carreira
5 months
RT @TengdaHan: We are looking for a student researcher to work on video understanding plus 3D, in Google DeepMind London. DM/Email me or pa….
0
33
0
@joaocarreira
joao carreira
9 months
RT @vansteenkiste_s: Excited to announce MooG for learning video representations. MooG allows tokens to move “off-the-grid” enabling better….
0
14
0
@joaocarreira
joao carreira
1 year
RT @dimadamen: Time to challenge VLMs?.Fed up of benchmarks claiming long-video reasoning but only need few seconds?. Try out Hour-Long VQA….
0
6
0
@joaocarreira
joao carreira
1 year
RT @skandakoppula: We're excited to release TAPVid-3D: an evaluation benchmark of 4,000+ real world videos and 2.1 million metric 3D point….
0
58
0
@joaocarreira
joao carreira
1 year
RT @shiryginosar: Join us next week at our second (high-level) intelligence workshop @SimonsInstitute!. Schedule: ..
0
9
0
@joaocarreira
joao carreira
1 year
The 2nd Perception Test Challenge is now on -- with a workshop happening in ECCV Milano later in the year. See all about it here and try out your top general perception models on it. Besides the original 6 tasks we'll have a new hour-long videoQA track.
0
1
8
@joaocarreira
joao carreira
1 year
RT @CarlDoersch: We present a new SOTA on point tracking, via self-supervised training on real, unlabeled videos! BootsTAPIR achieves 67.4%….
0
65
0
@joaocarreira
joao carreira
1 year
RT @shawshank_v: Delighted to host the 1st edition of our tutorial "Time is precious: Self-Supervised Learning Beyond Images" at @eccvconf….
0
10
0
@joaocarreira
joao carreira
1 year
RT @ShaneLegg: Our research project SIMA is creating a general, natural language instructable, multi 3D game-playing AI agent. The agent c….
0
82
0
@joaocarreira
joao carreira
2 years
Videos have a wealth of learning signal that is still underappreciated -- in fact, looks like a single long video can be as valuable as a large curated internet image dataset. Cool work from @shawshank_v et al with a new self-sup formulation where multi-object tracking emerges.
@shawshank_v
Shashank
2 years
Really happy to share that DoRA is accepted as an Oral to @iclr_conf #ICLR2024. Using just “1 video” from our new egocentric dataset - Walking Tours, we develop a new method that outperforms DINO pretrained on ImageNet on image and video downstream tasks. More details in 🧵👇.
0
5
24