
Fan-Yun Sun
@sunfanyun
Followers
1K
Following
167
Media
43
Statuses
138
cs phd @StanfordAILab @stanfordsvl @NVIDIAAI embodied AI, code generation, 3D
Stanford, CA
Joined October 2018
Spatial reasoning is a major challenge for the foundation models today, even in simple tasks like arranging objects in 3D space. #CVPR2025 .Introducing LayoutVLM, a differentiable optimization framework that uses VLM to spatially reason about diverse scene layouts from unlabeled
4
59
240
RT @chrmanning: I’ve joined @aixventureshq as a General Partner, working on investing in deep AI startups. Looking forward to working with….
wsj.com
Christopher Manning, one of the most cited researchers in the field of natural language processing and a former director of the Stanford AI Lab, has taken a leave of absence from Stanford University...
0
35
0
RT @sanjana__z: 🤖 Household robots are becoming physically viable. But interacting with people in the home requires handling unseen, uncons….
0
32
0
RT @wenlong_huang: How to scale visual affordance learning that is fine-grained, task-conditioned, works in-the-wild, in dynamic envs?. Int….
0
108
0
RT @silasalberti: we trained Kevin-32B = K(ernel D)evin using GRPO on KernelBench. it's to our knowledge the first open model trained using….
0
28
0
RT @sidahuj: 🧑🎨 The future of creative tools will look very different. 🧠 Imagine an AI control-centre for orchestrating complex tasks usi….
0
39
0
RT @sidahuj: 🧩 Built an MCP that lets Claude talk directly to Blender. It helps you create beautiful 3D scenes using just prompts!. Here’s….
0
1K
0
Huge thanks to the amazing team: @Weiyu_Liu_ (co-lead), Siyi Gu, @dill_pkl , Goutam Bhat, @fedassa , @ManlingLi_ , @nickhaber , @jiajunwu_cs . 🌐Project site: 💻 Code (we plan to open-source everything):. n/n.
github.com
Official code for "LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models" (CVPR 2025) - sunfanyunn/LayoutVLM
2
2
10
Automated 3D layout generation unlocks richer simulation environments for robotics and embodied AI, enabling:.🔹 More realistic scenes and layouts during training .🔹 Improved generalization for real-world deployment. Consider scene_synthesizer by @clembow, which shares a similar
1
1
5
RT @YueYangAI: We share Code-Guided Synthetic Data Generation: using LLM-generated code to create multimodal datasets for text-rich images,….
0
47
0