Sihyun Yu Profile
Sihyun Yu

@sihyun_yu

Followers
1K
Following
1K
Media
13
Statuses
159

Ph.D. student @ KAIST | Ex-intern @NVIDIAAI and @GoogleAI | Generative models | https://t.co/wTvMmsks3e

Daejeon
Joined July 2020
Don't wanna be here? Send us removal request.
@sihyun_yu
Sihyun Yu
10 months
Introducing REPA! We show that learning high-quality representations in diffusion transformers is crucial for boosting generation performance. With REPA, we speed up SiT training by 17.5x (without CFG) and achieve state-of-the-art FID = 1.42 using CFG with the guidance interval.
Tweet media one
6
46
286
@sihyun_yu
Sihyun Yu
21 days
RT @2prime_PKU: Anyone knows adam?
Tweet media one
0
463
0
@sihyun_yu
Sihyun Yu
29 days
RT @_akhaliq: Enhancing Motion Dynamics of Image-to-Video Models via Adaptive Low-Pass Guidance
0
21
0
@sihyun_yu
Sihyun Yu
30 days
RT @SoojungYang2: 🚀 Come check our poster at ICML @genbio_workshop!.We show that pretrained MLIPs can accelerate training of Boltzmann emul….
0
18
0
@sihyun_yu
Sihyun Yu
1 month
I’ve wondered why I2V models tend to generate more static videos compared to their T2V counterparts. This project, led by @june_suk_choi, provides an analysis of this phenomenon and introduces a very simple (yet effective) fix to address it! Excited to have been part of this.
@june_suk_choi
June Suk Choi
1 month
Excited to share Adaptive Low-Pass Guidance (ALG): a simple training-free, drop-in fix that brings dynamic motion back to Image-to-Video models! Demo videos, paper, & code below! .(🧵 1/7)
0
2
29
@sihyun_yu
Sihyun Yu
1 month
RT @sainingxie: @joserf28323 @CVPR @ICCVConference @nyuniversity Thanks for bringing this to my attention. I honestly wasn’t aware of the s….
0
29
0
@sihyun_yu
Sihyun Yu
2 months
Excited to share MDMs for molecule generation led by @bellaseo72 and @taewonKKK!.
@bellaseo72
Hyunjin Seo
2 months
Meet MELD: a masked diffusion model (MDMs) designed for de novo molecule generation. MELD assigns per-element learnable noise schedule that tailors noise at the atom & bond level to avoid state-clashing problem. With MELD we achieve state-of-the-art property alignment in
Tweet media one
0
1
11
@sihyun_yu
Sihyun Yu
2 months
RT @wenhaocha1: We introduce LiveCodeBench Pro. Models like o3-high, o4-mini, and Gemini 2.5 Pro score 0% on hard competitive programming p….
0
27
0
@sihyun_yu
Sihyun Yu
2 months
RT @CVPR: #CVPR2025 PAMI-TC awards
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
15
0
@sihyun_yu
Sihyun Yu
2 months
RT @RickyTQChen: Padding in our non-AR sequence models? Yuck. 🙅. 👉 Instead of unmasking, our new work *Edit Flows* perform iterative refine….
0
79
0
@sihyun_yu
Sihyun Yu
2 months
RT @sainingxie: Had a great time at this CVPR community-building workshop---lots of fun discussions and some really important insights for….
0
66
0
@sihyun_yu
Sihyun Yu
2 months
RT @ma_nanye: Join us for a full-day tutorial on Scalable Generative Models in Computer Vision at @CVPR in Nashville, on Wednesday, June 11….
0
21
0
@sihyun_yu
Sihyun Yu
3 months
RT @younggyoseo: Excited to present FastTD3: a simple, fast, and capable off-policy RL algorithm for humanoid control -- with an open-sourc….
0
114
0
@sihyun_yu
Sihyun Yu
3 months
RT @sainingxie: Indeed. For text-to-image, @xichen_pan had a great summary supporting this decoupled design philosophy: "Render unto diffus….
0
36
0
@sihyun_yu
Sihyun Yu
3 months
RT @DBahdanau: Adam deserves the award, but in Singapore everyone still uses SGD.
0
64
0
@sihyun_yu
Sihyun Yu
3 months
1. Controllable human generation: led by @cpis9898 .2. Long video tokenization: led by @huiwon0516 and @younggyoseo .3. Long video generation: an internship project at Google Research in collaboration with.
Tweet card summary image
arxiv.org
Diffusion models are successful for synthesizing high-quality videos but are limited to generating short clips (e.g., 2-10 seconds). Synthesizing sustained footage (e.g. over minutes) still...
0
0
5
@sihyun_yu
Sihyun Yu
3 months
I'll be at #CVPR2025 to present three papers on controllable human generation, efficient long video tokenization, and long video generation with memory modules. Would love to catch up — feel free to DM me if you're around and up for coffee!
Tweet media one
Tweet media two
Tweet media three
1
0
30
@sihyun_yu
Sihyun Yu
3 months
RT @ZhengyangGeng: Excited to share our work with my amazing collaborators, @Goodeat258, @SimulatedAnneal, @zicokolter, and Kaiming. In a….
0
39
0
@sihyun_yu
Sihyun Yu
3 months
RT @iScienceLuvr: Mean Flows for One-step Generative Modeling. "We introduce the notion of average velocity to characterize flow fields, i….
0
62
0