
Xingang Pan
@XingangP
Followers
3K
Following
322
Media
21
Statuses
75
Assistant Professor at Nanyang Technological University @NTUsg @MMLabNTU - Computer Vision, Deep Learning, Computer Graphics
Singapore
Joined May 2018
Cool work that connects the idea of volume rendering with image diffusion!.
Our paper LaRender received full marks at ICCV 2025 and was selected as oral! This paper enables control of occlusion relationships among objects and visual effects in a training-free manner for diffusion-based image generation. Project page:
0
0
0
Introducing ๐ฆ๐ง๐ฟ๐ฒ๐ฎ๐บ๐ฏ๐ฅ, a new 3D geometric foundation model for efficient 3D reconstruction from streaming input. Similar to LLMs, STream3R uses casual attention during training and KVCache at inference. No need to worry about post-alignment or reconstructing from scratch.
๐ฅStreaming-based 3D/4D Foundation Model๐ฅ. We present STream3R, which reformulates dense 3D/4D reconstruction into a sequential registration task with **causal attention**. - Projects: - Code: - Model:
4
16
92
Directly training Video Diffusion Models on long videos faces huge memory and learning challenges. How do we model long-range temporal distribution then?. Our ICCV 2025 work, ๐๏ธ๐ง๐ผ๐ธ๐ฒ๐ป๐๐๐ฒ๐ป, offers a solution. We compress videos into a highly condensed token space, enabling
0
24
102
๐ช๐ผ๐ฟ๐น๐ฑ๐ ๐ฒ๐บ is mainly created by @zeqi_xiao .Project page: ArXiv: Github: Demo:
1
0
4
Synthesizing worlds with video diffusion models is often inconsistent โ moving the camera back and forth leads to different scenes. We propose ๐๐ช๐ผ๐ฟ๐น๐ฑ๐ ๐ฒ๐บ, a memory-based approach that ensures consistent world simulation without relying on explicit 3D reconstruction.
While recent works like Genie 2, The Matrix, and Navigation World Models explore video generative models as world simulators, world consistency remains underexplored. In this work, we propose ๐WorldMem๐, introducing a memory mechanism for long-term consistent world simulation.
2
26
148
RT @TheYihangLuo: ๐ฅ Consistent Multi-View Diffusion for 3D Enhancement ๐ฅ. Introducing our work #3DEnhancer @CVPR: a multi-view diffusion moโฆ.
0
9
0
RT @zeqi_xiao: Introducing ๐กTrajectory Attention for Fine-grained Video Motion Control๐ก. By augmenting attention along predefined trajectorโฆ.
0
10
0