mct1224 Profile Banner
shengze wang Profile
shengze wang

@mct1224

Followers
47
Following
245
Media
11
Statuses
53

PhD student @UNC Chapel Hill, 3D vision & telepresence. MSCV @ CMU, ECE @ UIUC. ex-intern @ Intel, NVIDIA, Uber-ATG

Joined October 2020
Don't wanna be here? Send us removal request.
@mct1224
shengze wang
2 days
RT @YasutakaFuruka1: MapTracker: Online Consistent Vector HD Mapping by my students (Jiacheng Chen, Yuefan Wu, and Jiaqi Tan) and colleague….
0
14
0
@mct1224
shengze wang
2 days
RT @YasutakaFuruka1: Thank you for posting @_akhaliq .🚀 Excited to share Compressive Light-Field Tokens (CLiFT) — a new scene representatio….
0
12
0
@mct1224
shengze wang
5 days
RT @ShivamDuggal4: Compression is the heart of intelligence.From Occam to Kolmogorov—shorter programs=smarter representations. Meet KARL: K….
0
62
0
@mct1224
shengze wang
7 days
RT @MattNiessner: Stunning voice model by @synthesiaIO:. *𝐄𝐗𝐏𝐑𝐄𝐒𝐒-𝐕𝐨𝐢𝐜𝐞*. -> new SotA that perseveres identity, accent, expressiveness w/o….
0
10
0
@mct1224
shengze wang
8 days
RT @n_karaev: ⚡️Today we’re releasing SpatialTrackerV2, the first feedforward model for dynamic 3D reconstruction and point tracking in the….
0
23
0
@mct1224
shengze wang
11 days
RT @emmanuel_2m: All of these 3D objects were generated entirely via AI (each from a single image) and the results are reaching exceptional….
0
89
0
@mct1224
shengze wang
17 days
RT @gabriberton: I can't stress enough how useful this trick has been for me in all these years. It reduces GPU memory by N equal the numbe….
0
199
0
@mct1224
shengze wang
1 month
More results: We adapt to dynamic light changes (Img 1) and difficult expressions (Img 2) whereas portrait animation methods might find it difficult. Comparing to single image recon, the extra reference view helps improve identity coherence and reduce artifacts (Img 3) (7/7)
Tweet media one
Tweet media two
Tweet media three
0
0
1
@mct1224
shengze wang
1 month
We remove such distortions and improve occluded regions by leveraging an additional frontal reference image of the person. We use this reference to produce an warping that undistorts the raw reconstruction, and we use this reference image to help recover occluded regions (6/n)
Tweet media one
1
0
1
@mct1224
shengze wang
1 month
The core challenge is achieving temporally consistent reconstructions despite the varying head pose. As shown here, capturing the user from the sides can produce stretching/distortion and artifacts, making the lady’s face rounder. (5/n)
Tweet media one
1
0
1
@mct1224
shengze wang
1 month
2. Coherent3D improves temporal consistency for single-view 3D portrait reconstruction, while retaining more authentic expression changes and dynamic lighting/shoulder poses than mesh-driven portrait animation methods. The core challenge is … (3/n)
Tweet media one
1
0
1
@mct1224
shengze wang
1 month
More results
Tweet media one
Tweet media two
Tweet media three
1
0
1
@mct1224
shengze wang
1 month
Insight: prior works found it difficult to solve focal length+3D translation+3D human from a single image. We realize that it’s solvable by first accurately estimating metric pelvis depth, then 3D mesh, and finally 3D translation and focal.
Tweet media one
1
0
1
@mct1224
shengze wang
1 month
1. BLADE solves for both 3D human pose and camera, which is often neglected in prior works but very important for close range images with severe perspective distortion. BLADE achieves both accurate 2D alignment and 3D pose (2/n)
Tweet media one
1
0
1
@mct1224
shengze wang
1 month
At CVPR2025, I presented 2 of our papers:. 1. BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation. 2. Coherent 3D Portrait Video Reconstruction via Triplane Fusion. 1/n
Tweet media one
Tweet media two
1
4
8
@mct1224
shengze wang
2 months
RT @iam_NCJ: What if a Transformer could render?.Not text → image. But mesh → image — with global illumination. No rasterizers. No ray-tra….
0
86
0
@mct1224
shengze wang
2 months
RT @taiyasaki: This paper is awesome. 1) it fine tunes a video model on just 2 GPUs, opening academic research on the topic.2) the trainin….
0
9
0
@mct1224
shengze wang
2 months
RT @youngjoongkwon: I’ll be recruiting Ph.D. and Master’s students for Fall 2026. I’ll also be at CVPR this year-happy to connect there!.
0
3
0
@mct1224
shengze wang
3 months
RT @y0b1byte:
Tweet media one
Tweet media two
Tweet media three
0
85
0
@mct1224
shengze wang
3 months
RT @BrianRoemmele: BOOM!. FREE Text to multi-guest podcast AI. Open Source Nari Labs with Dia-1.6b just beat podcast-style clips on Google….
0
30
0