
Haven (Haiwen) Feng
@HavenFeng
Followers
1K
Following
2K
Media
12
Statuses
155
PhD student @MPI_IS, visiting @berkeley_ai now. Interested in machine learning, computer vision, computer graphic, and how to understand the physical world.
Germany
Joined October 2021
RT @xiuyu_l: Sparsity can make your LoRA fine-tuning go brrr 💨. Announcing SparseLoRA (ICML 2025): up to 1.6-1.9x faster LLM fine-tuning (2….
0
57
0
RT @seohong_park: Q-learning is not yet scalable. I wrote a blog post about my thoughts on scalable RL algorithms.….
0
186
0
🚨 Come see InterDyn at #CVPR2025!. We're showing how video generative models can simulate physical interactions without explicit simulator! 🌍🎬.📌 Poster #173. 🗓️ Saturday, June 14 | 🕥 10:30–12:30. 📍 ExHall D.🎤 Also catch Rick’s spotlight talk at Agents-in-Interaction.
🚀 Introducing InterDyn — our newly accepted CVPR work that explores controllable synthesis of interactive dynamics! Building upon powerful video diffusion models, InterDyn infers future motion and interactions directly from an input image and a dynamic control signal (e.g., a
0
2
33
RT @graceluo_: ✨New preprint: Dual-Process Image Generation! We distill *feedback from a VLM* into *feed-forward image generation*, at infe….
0
176
0
RT @sainingxie: Indeed. For text-to-image, @xichen_pan had a great summary supporting this decoupled design philosophy: "Render unto diffus….
0
35
0
So now we can collect robotics data without teleop??!.
Tired of teleoperating your robots?.We built a way to scale robot datasets without teleop, dynamic simulation, or even robot hardware. Just one smartphone scan + one human hand demo video → thousands of diverse robot trajectories. Trainable by diffusion policy and VLA models
0
0
6
RT @ChungMinKim: Excited to introduce PyRoki ("Python Robot Kinematics"): easier IK, trajectory optimization, motion retargeting. with an….
0
165
0
RT @arthurallshire: our new system trains humanoid robots using data from cell phone videos, enabling skills such as climbing stairs and si….
0
112
0
I will be presenting our SGP-Bench with @ItsTheZhen at #ICLR2025 🚀.Sat, 26th, 3:00–5:30 PM Singapore Time.Hall 3 + Hall 2B Poster #569. Can LLMs 'see' images directly via graphics code?!🧠🖼️ Come by our poster and let's chat!.
🚀 Excited to introduce our new work: SGP-Bench!. Can Large Language Models (LLMs) understand symbolic graphics programs? 🖥️ Imagine giving a model a symbolic graphics program like SVG or CAD and asking it to answer questions about the visual content without actually seeing the
0
5
18
RT @RickyTQChen: This ICLR is the best conference ever. Attendees are extremely friendly and cuddly. What do you mean this is the wrong….
0
27
0
Kudos to the amazing team with @junyi42(co-lead), @QianqianWang5, @yufei_ye, @pengcheng_147, @Michael_J_Black, @trevordarrell, and @akanazawa🖖.
0
0
4