joao carreira @joaocarreira X Profile

joao carreira

@joaocarreira

Followers

1K

Following

282

Media

10

Statuses

145

Research Scientist at Google DeepMind

London, England

Joined February 2009

Don't wanna be here? Send us removal request.

joao carreira

@joaocarreira

28 days

Scaling 4D Representations – new preprint and models now available

github.com

Contribute to google-deepmind/representations4d development by creating an account on GitHub.

3

42

203

joao carreira

@joaocarreira

23 days

3rd edition of the challenge with new exciting tasks and guest tracks; back during covid when we had the first workshop about the perception test ( some of us were afraid the benchmark was too difficult; now we just made it harder.

Nikhil Parthasarathy

@nikparth1

23 days

The 3rd Perception Test challenge is now accepting submissions ! Prizes of up to 50k EUR across Perception Test tracks are available. The winners will be announced at the Perception Test workshop at #ICCV2025. Submission deadline: October 6, 2025.

0

4

joao carreira

@joaocarreira

28 days

Most human knowledge was derived from our senses, part of it is passed around via language. Current AI excels on, but also depends on language. A next frontier of AI is developing the ability to create new knowledge from vision – large vision models may have a role to play there.

1

0

7

joao carreira

@joaocarreira

28 days

We found that larger models eventually surpass those trained with more sophisticated objectives, such as VJEPA, on non-semantic tasks (e.g. depth and camera pose estimation) while just matching them on higher-level tasks such as action recognition

1

0

10

joao carreira

@joaocarreira

28 days

There's been little work on video models beyond 1B parameters with self-supervision (e.g. no text or labels). In this work we used the simplest, lightest masked auto-encoding learning method and explored what happens when we scale models all the way to 22B parameters

1

0

8

joao carreira

@joaocarreira

30 days

RT @yanahasson: Thrilled to share our latest work on SciVid, to appear at #ICCV2025! 🎉.SciVid offers cross-domain evaluation of video model….

0

9

0

joao carreira

@joaocarreira

2 months

RT @sangwoomo: Can scaling data and models alone solve computer vision? 🤔.Join us at the SP4V Workshop at #ICCV2025 in Hawaii to explore th….

0

17

0

joao carreira

@joaocarreira

3 months

Individual frames out of generative video models tend to look reasonable; capturing actions happening over time realistically . that is way harder. TRAJAN is a new evaluation procedure to better guide progress in this (hot) area.

Kelsey Allen

@KelseyRAllen

3 months

Humans can tell the difference between a realistic generated video and an unrealistic one – can models?. Excited to share TRAJAN: the world’s first point TRAJectory AutoeNcoder for evaluating motion realism in generated and corrupted videos. 🌐 🧵

2

8

45

joao carreira

@joaocarreira

4 months

RT @TengdaHan: Check out our CVPR 2025 paper: Work with Dilara Gokay, Joseph Heyward, @ChuhanZhang5 , @DanielZoran….

arxiv.org

We address the challenge of representation learning from a continuous stream of video as input, in a self-supervised manner. This differs from the standard approaches to video learning where...

0

2

0

joao carreira

@joaocarreira

5 months

RT @TengdaHan: We are looking for a student researcher to work on video understanding plus 3D, in Google DeepMind London. DM/Email me or pa….

0

33

0

joao carreira

@joaocarreira

6 months

RT @EEMLcommunity: Apply here: Confirmed speakers: @AaronCourville @AldenHung @dianaborsa @09Emmar @joaocarreira @….

eeml.eu

Participation at the event is subject to selection. Please see below the details on how to apply. Deadline for applications: 31 March 2025, 7 April 2025, 23:59 Anywhere on Earth. Application period...

0

2

0

joao carreira

@joaocarreira

9 months

RT @vansteenkiste_s: Excited to announce MooG for learning video representations. MooG allows tokens to move “off-the-grid” enabling better….

0

14

0

joao carreira

@joaocarreira

1 year

RT @dimadamen: Time to challenge VLMs?.Fed up of benchmarks claiming long-video reasoning but only need few seconds?. Try out Hour-Long VQA….

0

6

0

joao carreira

@joaocarreira

1 year

RT @skandakoppula: We're excited to release TAPVid-3D: an evaluation benchmark of 4,000+ real world videos and 2.1 million metric 3D point….

0

58

0

joao carreira

@joaocarreira

1 year

RT @shiryginosar: Join us next week at our second (high-level) intelligence workshop @SimonsInstitute!. Schedule: ..

0

9

0

joao carreira

@joaocarreira

1 year

The 2nd Perception Test Challenge is now on -- with a workshop happening in ECCV Milano later in the year. See all about it here and try out your top general perception models on it. Besides the original 6 tasks we'll have a new hour-long videoQA track.

0

1

8

joao carreira

@joaocarreira

1 year

RT @CarlDoersch: We present a new SOTA on point tracking, via self-supervised training on real, unlabeled videos! BootsTAPIR achieves 67.4%….

0

65

0

joao carreira

@joaocarreira

1 year

RT @shawshank_v: Delighted to host the 1st edition of our tutorial "Time is precious: Self-Supervised Learning Beyond Images" at @eccvconf….

0

10

0

joao carreira

@joaocarreira

1 year

RT @ShaneLegg: Our research project SIMA is creating a general, natural language instructable, multi 3D game-playing AI agent. The agent c….

0

82

0

joao carreira

@joaocarreira

2 years

Videos have a wealth of learning signal that is still underappreciated -- in fact, looks like a single long video can be as valuable as a large curated internet image dataset. Cool work from @shawshank_v et al with a new self-sup formulation where multi-object tracking emerges.

Shashank

@shawshank_v

2 years

Really happy to share that DoRA is accepted as an Oral to @iclr_conf #ICLR2024. Using just “1 video” from our new egocentric dataset - Walking Tours, we develop a new method that outperforms DINO pretrained on ImageNet on image and video downstream tasks. More details in 🧵👇.

0

5

24