Xingyi He @XingyiHe1 X Profile

Xingyi He

@XingyiHe1

Followers

288

Following

123

Media

2

Statuses

18

Ph.D student in Zhejiang University, interest on computer vision

Joined April 2021

Don't wanna be here? Send us removal request.

Xingyi He

@XingyiHe1

10 months

Excited to share our work MatchAnything: We pre-train strong universal image matching models that exhibit remarkable generalizability on unseen multi-modality matching and registration tasks. Project page: https://t.co/o5GisUJ7RT Huggingface Demo: https://t.co/qbz33QBulI

19

160

819

Xingyi He

@XingyiHe1

26 days

Excited to share our new work — BoxDreamer 🚀 A generalizable object pose estimation framework that needs only a single video clip — no object scanning, no reconstruction required. Achieves real-time (~60 FPS) performance within minutes. HuggingFaceDemo: https://t.co/WeODruaWrf

0

8

Yuanhong Yu

@Yuanhongyu929

27 days

Introducing BoxDreamer! 🚀Our latest generalizable object pose estimation method. Make a single video clip 📹 ready for real-time (~60 FPS) object pose estimation — all in just minutes. Code: https://t.co/xnEmYY27v4 Video: https://t.co/tanRUtDqfR #ICCV2025

2

3

13

Chuanruo Ning

@TritiumAc

5 months

How can robots solve tasks that demand both semantic and physical reasoning, like playing real-world Angry Birds, without tons of data? We introduce Prompting with the Future: an MPC framework that fuses a pretrained VLM with an interactive digital twin for grounded, open-world

7

34

151

Yuxin Chen

@ThomasYuxinChen

5 months

💡Can we let an arm-mounted quadrupedal robot to perform task with both arms and legs? Introducing ReLIC: Reinforcement Learning for Interlimb Coordination for versatile loco-manipulation in unstructured environments. [1/6] https://t.co/cOyPC5ZOvp

18

50

255

Jianyuan

@jianyuan_wang

8 months

Introducing VGGT (CVPR'25), a feedforward Transformer that directly infers all key 3D attributes from one, a few, or hundreds of images, in seconds! No expensive optimization needed, yet delivers SOTA results for: ✅ Camera Pose Estimation ✅ Multi-view Depth Estimation ✅ Dense

21

198

1K