Xingyi He
@XingyiHe1
Followers
288
Following
123
Media
2
Statuses
18
Ph.D student in Zhejiang University, interest on computer vision
Joined April 2021
Excited to share our work MatchAnything: We pre-train strong universal image matching models that exhibit remarkable generalizability on unseen multi-modality matching and registration tasks. Project page: https://t.co/o5GisUJ7RT Huggingface Demo: https://t.co/qbz33QBulI
19
160
819
Excited to share our new work β BoxDreamer π A generalizable object pose estimation framework that needs only a single video clip β no object scanning, no reconstruction required. Achieves real-time (~60 FPS) performance within minutes. HuggingFaceDemo: https://t.co/WeODruaWrf
0
0
8
Introducing BoxDreamer! πOur latest generalizable object pose estimation method. Make a single video clip πΉ ready for real-time (~60β―FPS) object pose estimation β all in just minutes. Code: https://t.co/xnEmYY27v4 Video: https://t.co/tanRUtDqfR
#ICCV2025
2
3
13
How can robots solve tasks that demand both semantic and physical reasoning, like playing real-world Angry Birds, without tons of data? We introduce Prompting with the Future: an MPC framework that fuses a pretrained VLM with an interactive digital twin for grounded, open-world
7
34
151
π‘Can we let an arm-mounted quadrupedal robot to perform task with both arms and legs? Introducing ReLIC: Reinforcement Learning for Interlimb Coordination for versatile loco-manipulation in unstructured environments. [1/6] https://t.co/cOyPC5ZOvp
18
50
255
Introducing VGGT (CVPR'25), a feedforward Transformer that directly infers all key 3D attributes from one, a few, or hundreds of images, in seconds! No expensive optimization needed, yet delivers SOTA results for: β
Camera Pose Estimation β
Multi-view Depth Estimation β
Dense
21
198
1K