Xingyi He Profile
Xingyi He

@XingyiHe1

Followers
288
Following
123
Media
2
Statuses
18

Ph.D student in Zhejiang University, interest on computer vision

Joined April 2021
Don't wanna be here? Send us removal request.
@XingyiHe1
Xingyi He
10 months
Excited to share our work MatchAnything: We pre-train strong universal image matching models that exhibit remarkable generalizability on unseen multi-modality matching and registration tasks. Project page: https://t.co/o5GisUJ7RT Huggingface Demo: https://t.co/qbz33QBulI
19
160
819
@XingyiHe1
Xingyi He
26 days
Excited to share our new work β€” BoxDreamer πŸš€ A generalizable object pose estimation framework that needs only a single video clip β€” no object scanning, no reconstruction required. Achieves real-time (~60 FPS) performance within minutes. HuggingFaceDemo: https://t.co/WeODruaWrf
0
0
8
@Yuanhongyu929
Yuanhong Yu
27 days
Introducing BoxDreamer! πŸš€Our latest generalizable object pose estimation method. Make a single video clip πŸ“Ή ready for real-time (~60β€―FPS) object pose estimation β€” all in just minutes. Code: https://t.co/xnEmYY27v4 Video: https://t.co/tanRUtDqfR #ICCV2025
2
3
13
@TritiumAc
Chuanruo Ning
5 months
How can robots solve tasks that demand both semantic and physical reasoning, like playing real-world Angry Birds, without tons of data? We introduce Prompting with the Future: an MPC framework that fuses a pretrained VLM with an interactive digital twin for grounded, open-world
7
34
151
@ThomasYuxinChen
Yuxin Chen
5 months
πŸ’‘Can we let an arm-mounted quadrupedal robot to perform task with both arms and legs? Introducing ReLIC: Reinforcement Learning for Interlimb Coordination for versatile loco-manipulation in unstructured environments. [1/6] https://t.co/cOyPC5ZOvp
18
50
255
@jianyuan_wang
Jianyuan
8 months
Introducing VGGT (CVPR'25), a feedforward Transformer that directly infers all key 3D attributes from one, a few, or hundreds of images, in seconds! No expensive optimization needed, yet delivers SOTA results for: βœ… Camera Pose Estimation βœ… Multi-view Depth Estimation βœ… Dense
21
198
1K