Junchen Liu @JunchenLiu77 X Profile

Junchen Liu

@JunchenLiu77

Followers

295

Following

176

Media

4

Statuses

46

PhD student @UofT @VectorInst.

https://t.co/qlBH5wu1ms

Toronto

Joined September 2022

Don't wanna be here? Send us removal request.

Huan Ling

@HuanLing6

15 days

1/ #NVIDIAGTC We’re excited to share that ChronoEdit-14B model and 8-step Distillation LoRA (4s/image on H100) are released today. 🤗 Model https://t.co/X3diGAY42p 🤗 Demo https://t.co/2xfiRo6wij 💡ChronoEdit brings temporal reasoning to image editing task. It achieves STOA

5

35

106

Ruilong Li

@ruilong_li

28 days

Checkout our latest work on Gaussian Splatting for LiDAR with 3DGUT!

Haithem Turki

@Haithem_Turki

28 days

[1/N] Excited to introduce "SimULi: Real-Time LiDAR and Camera Simulation with Unscented Transforms." We extend 3DGUT with LiDAR support and render a wide range of sensors 10-20x faster than ray tracing and 1.5-10x faster than prior rasterization work. https://t.co/Q9J2T5cjLj

2

30

381

Kai He

@Kai__He

28 days

Please check out this tool! It enables easy generation of dynamic G-buffers and data under various lighting conditions. It’s used for data generation in DiffusionRenderer, UniRelight, and LuxDiT. 🚀

Ruofan Liang

@RfLiang

28 days

Just dropped a Blender-based data generation tool that can be used to render randomly composed synthetic scenes with all G-Buffer attributes. 😋

1

9

76

Hang Gao

@hangg70

1 month

Excited to share what I’ve been working on since joining @xai — Grok Imagine v0.9. You can try it at https://t.co/lhQN6rSpAr . Looking back on the past two months, a few lessons really stuck with me: (1) have faith in scaling (cautiously); (2) solid, well-tracked

xAI

@xai

1 month

Introducing Imagine v0.9, our new video generation model with massive upgrades from v0.1 in visual quality, motion, audio generation, and more. Now available for free on all our products: https://t.co/2DPEzEZ03e

25

24

307

Xuanchi Ren

@xuanchi13

1 month

Zero-shot video reasoning (chain-of-frames) isn’t just for Veo3 — open-source models can understand and edit too! 🕹️ ChronoEdit brings temporal reasoning to image editing. 🔗 https://t.co/6pyTDfzmGH

Huan Ling

@HuanLing6

1 month

🕹️We are excited to introduce "ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation" ChronoEdit reframes image editing as a video generation task to encourage temporal consistency. It leverages a temporal reasoning stage that denoises with “video

1

18

125

Huan Ling

@HuanLing6

1 month

🕹️We are excited to introduce "ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation" ChronoEdit reframes image editing as a video generation task to encourage temporal consistency. It leverages a temporal reasoning stage that denoises with “video

6

37

140

HU Wenbo

@wbhu_cuhk

2 months

🎉 Excited to share our latest work in streaming long video generation—say hi to #RollingForcing! This tool lets you create multi-minute videos in real-time, with minimal error accumulation. We’re fired up to think it could be a fundamental component of interactive #WorldModel

2

3

21

Ethan Weber

@ethanjohnweber

1 month

📢 SceneComp @ ICCV 2025 🏝️ 🌎 Generative Scene Completion for Immersive Worlds 🛠️ Reconstruct what you know AND 🪄 Generate what you don’t! 🙌 Meet our speakers @angelaqdai, @holynski_, @jampani_varun, @ZGojcic @taiyasaki, Peter Kontschieder https://t.co/LvONYIK3dz #ICCV2025

2

17

53

Sherwin Bahmani

@sherwinbahmani

2 months

📢 Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation Got only one or a few images and wondering if recovering the 3D environment is a reconstruction or generation problem? Why not do it with a generative reconstruction model! We show that a

19

71

251

Justin Kerr

@justkerrding

2 months

Should robots have eyeballs? Human eyes move constantly and use variable resolution to actively gather visual details. In EyeRobot ( https://t.co/iSL7ZLZcHu) we train a robot eyeball entirely with RL: eye movements emerge from experience driven by task-driven rewards.

8

56

272

AK

@_akhaliq

2 months

Nvidia just released Lyra on Hugging Face Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation TL;DR: Feed-forward 3D and 4D scene generation from a single image/video trained with synthetic data generated by a camera-controlled video diffusion model

17

96

547

Ethan Weber

@ethanjohnweber

2 months

It’s live! 🎉 🗺️ It was very fun working with @Nik__V__ and our team @Meta for this release. I’m excited to see how the community uses it. 😃

Nikhil Keetha

@Nik__V__

2 months

Meet MapAnything – a transformer that directly regresses factored metric 3D scene geometry (from images, calibration, poses, or depth) in an end-to-end way. No pipelines, no extra stages. Just 3D geometry & cameras, straight from any type of input, delivering new state-of-the-art

0

3

31

Ruofan Liang

@RfLiang

2 months

💡 Introducing LuxDiT: a diffusion transformer (DiT) that estimates realistic scene lighting from a single image or video. It produces accurate HDR environment maps, addressing a long-standing challenge in computer vision. 🔗Paper: https://t.co/6cW6WlREBl

3

58

275

Esther Lin

@estheroate

2 months

Every lens leaves a blur signature—a hidden fingerprint in every photo. In our new #TPAMI paper, we show how to learn it fast (5 mins of capture!) with Lens Blur Fields ✨ With it, we can tell apart ‘identical’ phones by their optics, deblur images, and render realistic blurs.

157

711

7K

Junchen Liu

@JunchenLiu77

3 months

3D annotation has never been easier!

Jiahui Huang

@huangjh_hjh

3 months

[1/N] 🎥 We've made available a powerful spatial AI tool named ViPE: Video Pose Engine, to recover camera motion, intrinsics, and dense metric depth from casual videos! Running at 3–5 FPS, ViPE handles cinematic shots, dashcams, and even 360° panoramas. 🔗 https://t.co/1mGDxwgYJt

0

4

Ruilong Li

@ruilong_li

3 months

Genie3 is like magic! Curious the best way to add viewpoint conditioning signal into transformer? Check this out 👉

liruilong.cn

We introduce PRoPE, a method for conditioning image tokens based on corresponding camera parameters in transformers for multiview vision tasks.

Aleksander Holynski

@holynski_

3 months

Another one. Already a powerful painting, but moving around it yourself gives a totally different feeling. Jacques Louis David's "The Death of Socrates" => #Genie3

1

4

66

Junchen Liu

@JunchenLiu77

3 months

A real-time interactive model with almost perfect 3D consistency and long-term memory!!

Aleksander Holynski

@holynski_

3 months

Another one. Already a powerful painting, but moving around it yourself gives a totally different feeling. Jacques Louis David's "The Death of Socrates" => #Genie3

0

1

Alexandre Morgand

@Almorgand

4 months

"Cameras as Relative Positional Encoding" TLDR: comparison for conditioning transformers on cameras: token-level raymap, attention-level relative pose encodings, a (new) relative encoding Projective Positional Encoding -> camera frustums, (int|ext)insics for relative pos encoding

2

51

465

Junchen Liu

@JunchenLiu77

3 months

Viser is a very practical open-source tool for visualizing your 3D data in realtime, interactively and easily shareable!

Brent Yi

@brenthyi

4 months

July has been a big month for Viser! - Released v1.0.0😊 - We did some writing Some demos👇

0

8

David McAllister

@davidrmcall

4 months

Excited to share Flow Matching Policy Gradients: expressive RL policies trained from rewards using flow matching. It’s an easy, drop-in replacement for Gaussian PPO on control tasks.

8

206

1K