SifeiL
@Sifei30488L
Followers
134
Following
11
Media
3
Statuses
9
Joined October 2023
🚀 Excited to share the release of GSPN! 🎉 #CVPR2025 #nvidia Our release includes a custom, highly optimized CUDA kernel for the core GSPN operation. 👉 Check out the code and try it for your vision foundational models: https://t.co/1IMW88SNGQ 📑Paper:
lnkd.in
This link will take you to a page that’s not on LinkedIn
The code of GSPN #CVPR2025 is released! We proposed a new sqrt(N) complexity attention mechanism, which enables efficient high resolution image generation. We can generate 8k images with 42x speed up compared to self-attention in StableDiffusionXL! Code: https://t.co/iiTjRemcl9
1
7
30
📢 Unposed few-view 3D reconstruction has never been so easy, and SOTA pose estimation as a byproduct! Check out our #ICLR2025 ORAL paper (top 1.8%): NoPoSplat! Catch the amazing @Botao_Ye at: Oral: Thu 4:18 pm Poster: Thu 10 am (#204) Website: https://t.co/uBhK0tvoVU
1
14
157
🚨Paper Alert 🚨 ➡️Paper Title: Describe Anything: Detailed Localized Image and Video Captioning 🌟Few pointers from the paper 🎯Generating detailed and accurate descriptions for specific regions in images and videos remains a fundamental challenge for vision-language models.
0
1
1
@nvidia @wonmin_byeon @kaihan_vis @xiaolonw @JinweiGu98 @jankautz @Jerry_XU_Jiarui Core contributors: Hongjun Wang, @wonmin_byeon , @Jerry_XU_Jiarui , @xiaolonw , @kaihan_vis , @JinweiGu98 , Charles Cheung, and @jankautz Key Highlights: 1. GSPN excels across various vision tasks, maintaining superior spatial coherence and fidelity. 2. Remarkably, it
0
3
6
Introducing GSPN: A Leap Forward in Vision Attention Mechanisms Paper: https://t.co/kQxemuWRNr Project: https://t.co/Ly2bd1KFcv We present GSPN (Generalized Spatial Propagation Network), a novel attention mechanism developed at @nvidia. Unlike pixel-to-pixel scans like mamba,
3
47
159
Let’s think about humanoid robots outside carrying the box. How about having the humanoid come out the door, interact with humans, and even dance? Introducing Expressive Whole-Body Control for Humanoid Robots: https://t.co/BI0Hvt7I7O See how our robot performs rich, diverse,
89
185
1K
3D Gaussian Splatting is great, but can it work without the pre-computed camera poses? Introducing: COLMAP-Free 3D Gaussian Splatting Our recent work shows not only it can, but 3D Gaussians make camera pose estimation easy (compared to NeRF) along with reconstruction. 👇🧵
4
59
299