SifeiL @Sifei30488L X Profile

SifeiL

@Sifei30488L

Followers

134

Following

11

Media

3

Statuses

9

Joined October 2023

Don't wanna be here? Send us removal request.

SifeiL

@Sifei30488L

6 months

🚀 Excited to share the release of GSPN! 🎉 #CVPR2025 #nvidia Our release includes a custom, highly optimized CUDA kernel for the core GSPN operation. 👉 Check out the code and try it for your vision foundational models: https://t.co/1IMW88SNGQ 📑Paper:

lnkd.in

This link will take you to a page that’s not on LinkedIn

Xiaolong Wang

@xiaolonw

6 months

The code of GSPN #CVPR2025 is released! We proposed a new sqrt(N) complexity attention mechanism, which enables efficient high resolution image generation. We can generate 8k images with 42x speed up compared to self-attention in StableDiffusionXL! Code: https://t.co/iiTjRemcl9

1

7

30

Songyou Peng

@songyoupeng

7 months

📢 Unposed few-view 3D reconstruction has never been so easy, and SOTA pose estimation as a byproduct! Check out our #ICLR2025 ORAL paper (top 1.8%): NoPoSplat! Catch the amazing @Botao_Ye at: Oral: Thu 4:18 pm Poster: Thu 10 am (#204) Website: https://t.co/uBhK0tvoVU

1

14

157

naveen manwani

@NaveenManwani17

7 months

🚨Paper Alert 🚨 ➡️Paper Title: Describe Anything: Detailed Localized Image and Video Captioning 🌟Few pointers from the paper 🎯Generating detailed and accurate descriptions for specific regions in images and videos remains a fundamental challenge for vision-language models.

0

1

SifeiL

@Sifei30488L

10 months

@nvidia @wonmin_byeon @kaihan_vis @xiaolonw @JinweiGu98 @jankautz @Jerry_XU_Jiarui Core contributors: Hongjun Wang, @wonmin_byeon , @Jerry_XU_Jiarui , @xiaolonw , @kaihan_vis , @JinweiGu98 , Charles Cheung, and @jankautz Key Highlights: 1. GSPN excels across various vision tasks, maintaining superior spatial coherence and fidelity. 2. Remarkably, it

0

3

6

SifeiL

@Sifei30488L

10 months

Introducing GSPN: A Leap Forward in Vision Attention Mechanisms Paper: https://t.co/kQxemuWRNr Project: https://t.co/Ly2bd1KFcv We present GSPN (Generalized Spatial Propagation Network), a novel attention mechanism developed at @nvidia. Unlike pixel-to-pixel scans like mamba,

3

47

159

Xiaolong Wang

@xiaolonw

2 years

Let’s think about humanoid robots outside carrying the box. How about having the humanoid come out the door, interact with humans, and even dance? Introducing Expressive Whole-Body Control for Humanoid Robots: https://t.co/BI0Hvt7I7O See how our robot performs rich, diverse,

89

185

1K

Xiaolong Wang

@xiaolonw

2 years

3D Gaussian Splatting is great, but can it work without the pre-computed camera poses? Introducing: COLMAP-Free 3D Gaussian Splatting Our recent work shows not only it can, but 3D Gaussians make camera pose estimation easy (compared to NeRF) along with reconstruction. 👇🧵

4

59

299