Yiming Dou @_YimingDou X Profile

Yiming Dou

@_YimingDou

Followers

753

Following

277

Media

11

Statuses

78

Ph.D. student at UMich | B.Eng. from SJTU | Computer Vision, Multimodal, Robotics

Shanghai ↔️ Ann Arbor

Joined March 2022

Don't wanna be here? Send us removal request.

Yiming Dou

@_YimingDou

28 days

Ever wondered how a scene sounds👂 when you interact👋 with it?. Introducing our #CVPR2025 work "Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes" -- we make 3D scene reconstructions audibly interactive! .

2

33

95

Yiming Dou

@_YimingDou

22 days

RT @pliang279: Despite much progress in AI, the ability for AI to 'smell' like humans remains elusive. Smell AIs 🤖👃can be used for allergen….

0

17

0

Yiming Dou

@_YimingDou

27 days

RT @jin_linyi: Hello! If you are interested in dynamic 3D or 4D, don't miss the oral session 3A at 9 am on Saturday:. @zhengqi_li .will be….

0

6

0

Yiming Dou

@_YimingDou

28 days

RT @ayshrv: Excited to share our CVPR 2025 paper on cross-modal space-time correspondence!. We present a method to match pixels across diff….

0

28

0

Yiming Dou

@_YimingDou

28 days

RT @jespark0: Can AI image detectors keep up with new fakes?. Mostly, no. Existing detectors are trained using a handful of models. But the….

0

9

0

Yiming Dou

@_YimingDou

28 days

Wonderful collaboration with Wonseok Oh, Yuqing Luo, @antoniloq, @andrewhowens!!!.

0

Yiming Dou

@_YimingDou

28 days

Combining with our previous #CVPR2024 work TaRF (, we create an immersive 3D scene reconstruction that allows users to interact with it using sight👀, touch👆 and sound👂.

1

0

Yiming Dou

@_YimingDou

28 days

Next, we train a rectified flow model to generate the interaction sound conditioned on a 3D hand trajectory and video frames rendered from a 3D reconstruction of a scene. Generally, the predictions are accurate in both motion synchronization and material properties.

1

0

Yiming Dou

@_YimingDou

28 days

To capture the interactions, the human collector interacts with the scene by performing various actions with their hands. We lift the annotator's hands to the scene reconstruction, and render a video of the interaction by projecting 3D hands on the scene.

1

0

Yiming Dou

@_YimingDou

28 days

Traditional 3D reconstruction captures realistic visuals, but what about the sounds of interactions? Our work bridges this gap by predicting realistic audio from hand-object interactions in 3D scenes.

1

0

1

Yiming Dou

@_YimingDou

28 days

RT @dangengdg: Hello! If you like pretty images and videos and want a rec for CVPR oral session, you should def go to Image/Video Gen, Frid….

0

16

0

Yiming Dou

@_YimingDou

3 months

RT @_crockwell: Ever wish YouTube had 3D labels?. 🚀Introducing🎥DynPose-100K🎥, an Internet-scale collection of diverse videos annotated with….

0

39

0

Yiming Dou

@_YimingDou

3 months

RT @ju_yuanchen: 🧩#CVPR2025🌷Introducing Two By Two✌️: The First Large-Scale Daily Pairwise Assembly Dataset with SE(3)-Equivariant Pose Est….

0

22

0

Yiming Dou

@_YimingDou

4 months

Thanks to @OpenAI, got a chance to grow up again in Ghibli anime🤗

0

15

Yiming Dou

@_YimingDou

6 months

RT @SarahJabbour_: I’m on the PhD internship market for Spr/Summer 2025! I have experience in multimodal AI (EHR, X-ray, text), explainabil….

0

12

0

Yiming Dou

@_YimingDou

7 months

RT @ju_yuanchen: 🍌We present DenseMatcher！.🤖️DenseMatcher enables robots to acquire generalizable skills across diverse object categories b….

0

29

0

Yiming Dou

@_YimingDou

7 months

RT @dangengdg: What happens when you train a video generation model to be conditioned on motion?. Turns out you can perform "motion prompti….

0

148

0

Yiming Dou

@_YimingDou

9 months

RT @junyi42: Excited to share MonST3R! -- a simple way to estimate geometry from unposed video of dynamic scene. We achieve competitive res….

0

140

0

Yiming Dou

@_YimingDou

9 months

RT @Zichen2501: Differentiable rendering made SIMPLE❗️. Differentiating physically based renderers is hard: Dirac-delta discontinuities ari….

0

55

0

Yiming Dou

@_YimingDou

9 months

RT @ayshrv: We present Global Matching Random Walks, a simple self-supervised approach to the Tracking Any Point (TAP) problem, accepted to….

0

23

0