Junsong_Chen @lawrence_cjs X Profile

Junsong_Chen

@lawrence_cjs

Followers

203

Following

79

Media

9

Statuses

53

HKU Ph.D, NVIDIA Research Internship

https://t.co/Kph5U0iMdA

Hong Kong

Joined February 2022

Don't wanna be here? Send us removal request.

Enze Xie

@xieenze_jr

7 days

We (@lawrence_cjs, @yuyangzhao_ , @shanasaimoe) from the SANA team just posted a blog on the core of Linear Attention: how it achieves infinite context lengths with global awareness but constant memory usage! We explore state accumulation mechanics, the evolution from Softmax to

Enze Xie

@xieenze_jr

1 month

The training/ Inference code and checkpoints are released. Welcome to try!

4

34

179

Junsong_Chen

@lawrence_cjs

7 days

How Linear Attention and Softmax Attention differ in compute and KV-Cache for LLMs and long-video generation. Let's start with this blog. https://t.co/Ja5El08muf

Enze Xie

@xieenze_jr

7 days

We (@lawrence_cjs, @yuyangzhao_ , @shanasaimoe) from the SANA team just posted a blog on the core of Linear Attention: how it achieves infinite context lengths with global awareness but constant memory usage! We explore state accumulation mechanics, the evolution from Softmax to

0

1

Enze Xie

@xieenze_jr

2 months

Sora 2 is amazing!, But AI video generation inference speed is too slow. Try our Deep Compression Autoencoder + Linear Attention! 🚀🔥 https://t.co/ooNowz8HH7 https://t.co/PU8oUI2hsU

github.com

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder - dc-ai-projects/DC-VideoGen

Enze Xie

@xieenze_jr

2 months

🚀 SANA-Video: Linear Attention + Constant-Memory KV Cache = Fast Long Videos 💥 Key Features 🌟 🧠 Linear DiT everywhere → O(N) complexity on video-scale tokens 🧰 Constant-memory Block KV cache → store cumulative states only (no growing KV) 🔄 🎯 Temporal Mix-FFN + 3D RoPE

1

8

71

Junsong_Chen

@lawrence_cjs

2 months

Thanks so much @_akhaliq for sharing our recent work. Our homepage is here:

AK

@_akhaliq

2 months

SANA-Video Efficient Video Generation with Block Linear Diffusion Transformer

0

1

Han Cai

@hancai_hm

2 months

Changing the autoencoder in latent diffusion models is easier than you think. 🚀 Introducing DC-Gen – a post-training acceleration framework that works with any pre-trained diffusion model, boosting efficiency by transferring it into a deeply compressed latent space with

5

38

222

Han Cai

@hancai_hm

2 months

We release DC-VideoGen, a new post-training framework for accelerating video diffusion models. Key features: 🎬 Supports video generation up to 2160×3840 (4K) resolution on a single H100 GPU ⚡ Delivers 14.8× faster inference than the base model while achieving comparable or

2

28

145

Enze Xie

@xieenze_jr

2 months

🚀 SANA-Video: Linear Attention + Constant-Memory KV Cache = Fast Long Videos 💥 Key Features 🌟 🧠 Linear DiT everywhere → O(N) complexity on video-scale tokens 🧰 Constant-memory Block KV cache → store cumulative states only (no growing KV) 🔄 🎯 Temporal Mix-FFN + 3D RoPE

3

20

126

Junsong_Chen

@lawrence_cjs

2 months

Finally, 36s for 5s 720p on H100; 4× speedup vs vanilla attention at 720p 29s on RTX 5090 with NVFP4 (2.4x faster) Fixed VRAM vs sequence length; strong text–video alignment

0

Junsong_Chen

@lawrence_cjs

2 months

3. Temporal Mix-FFN+3D RoPE → local fidelity + temporal coherence 🎯 4. AR block training with Self rollout → minute-length generation 📊

1

0

Junsong_Chen

@lawrence_cjs

2 months

2. Constant-Memory Block KV cache→cumulative states only (no growing KV) 🔄

1

0

Junsong_Chen

@lawrence_cjs

2 months

Keys🌟 1. Linear DiT everywhere → O(N) complexity on video-scale tokens

1

0

Junsong_Chen

@lawrence_cjs

2 months

🚀 SANA-Video: Linear Attention + Constant-Memory KV Cache = Fast Long Videos 💥 It's time for a new SANA family member! Links 🌐 📖 Paper: https://t.co/snV4bF8jUM 💻 Project Page: https://t.co/9WZIp7ryX6

1

3

Junsong_Chen

@lawrence_cjs

2 months

Explore recent work from our team. Long-Live generates minute-length videos and interacts as you want with real-time fast speed! Very cool project. 🎉

Yukang Chen

@yukangchen_

2 months

🚀 We open-sourced LongLive — interactive, real-time long-video generation. 👥Generates video in real time as users enter text prompts. ⚡️20.7 FPS on a single H100,⏱️up to 240s per clip. 🎬Fine-tunes SOTA short-video models (e.g., Wan) into long-video generators. 🌍One step

0

1

Song Han

@songhan_mit

3 months

Explore Deep Compression Autoencoder (DC-AE) 1.5 with higher token compression ratio (64x) for faster visual generation:

Han Cai

@hancai_hm

3 months

🚀 Excited to announce DC-AE 1.5! With a spatial compression ratio boosted to f64, it accelerates high-res diffusion models while preserving text-to-image quality. Key innovation: channel-wise latent structure for faster convergence with many latent channels. 📍 Catch us at

1

2

24

Sayak Paul

@RisingSayak

9 months

The best few-step sampling model across the speed-memory frontier? 😱 Introducing SANA-Sprint in collaboration with the great SANA team! Beyond the results, perhaps more importantly, the work is about the recipe of SANA-Sprint. Code & model will be open ❤️ Let's go ⬇️

12

26

162

AK

@_akhaliq

9 months

SANA-Sprint One-Step Diffusion with Continuous-Time Consistency Distillation

10

66

425

Song Han

@songhan_mit

9 months

Explore our one-step diffusion model, SANA-Sprint. Very fast:

0

5

36

Cheng Lu

@clu_cheng

9 months

Still think consistency models are bad at scale? In fact, sCM can be stably scaled to modern text-to-image diffusion models and greatly improve the generation speed and 1-step generation quality!

3

4

55

Junsong_Chen

@lawrence_cjs

9 months

Excited for 🏃SANA-Sprint. 🚀Code and weights will be released very soon along with diffusers. Study tuned!❤️

0

3

Junsong_Chen

@lawrence_cjs

10 months

Introducing Sana-1.5. Model scaling up, then scaling down. Also inference time scaling is working as an auto end to end pipeline.

Enze Xie

@xieenze_jr

10 months

🔥 SANA 1.5: A linear Diffusion Transformer pushes SOTA in text-to-image generation! Key innovations: • Depth-growth training: 1.6B → 4.8B params • Memory-efficient 8-bit optimizer • Flexible model pruning • Inference scaling for better quality Achieves 0.80 on GenEval! 🚀

0

2