YinCuiCV Profile Banner
Yin Cui Profile
Yin Cui

@YinCuiCV

Followers
6K
Following
1K
Media
46
Statuses
400

Research Scientist @NVIDIA | Formerly @Google, @Cornell | Views are my own

Mountain View, CA
Joined October 2012
Don't wanna be here? Send us removal request.
@YinCuiCV
Yin Cui
2 months
Introducing the Describe Anything Model (DAM), a powerful Multimodal LLM that generates detailed descriptions for user-specified regions in images or videos using points, boxes, scribbles, or masks. Open-source code, models, demo, data, and benchmark at:
8
78
407
@YinCuiCV
Yin Cui
4 days
RT @LongTonyLian: Excited to share that Describe Anything has been accepted at ICCV 2025! 🎉. Describe Anything Model (DAM) is a powerful Mu….
0
24
0
@YinCuiCV
Yin Cui
8 days
RT @hanna_mao: We build Cosmos-Predict2 as a world foundation model for Physical AI builders — fully open and adaptable. Post-train it for….
0
70
0
@YinCuiCV
Yin Cui
11 days
RT @googleaidevs: Introducing Gemini CLI, a light and powerful open-source AI agent that brings Gemini directly into your terminal. >_. Wri….
0
2K
0
@YinCuiCV
Yin Cui
19 days
RT @ComfyUI: 🎉 ComfyUI now natively supports NVIDIA’s Cosmos-Predict2 model family!. Cosmos-Predict2 brings high-fidelity, physics-aware Im….
0
42
0
@YinCuiCV
Yin Cui
20 days
RT @hanna_mao: 🚀 We're releasing Cosmos-Predict2 — our developer-first, top-performing world foundation models for Physical AI!.🔗 https://t….
0
39
0
@YinCuiCV
Yin Cui
22 days
RT @ashawkey3: Happy to share our work PartPacker:.We enable one-shot image-to-3D generation with any number of parts!. Project page: https….
0
16
0
@YinCuiCV
Yin Cui
24 days
RT @chenhsuanlin: Cosmos-Predict2 is our latest open video foundation model for Physical AI!. If you’re at #cvpr202….
0
5
0
@YinCuiCV
Yin Cui
25 days
RT @TsungYiLinCV: The physics meets vision workshop just started! Come joining us!
Tweet media one
0
4
0
@YinCuiCV
Yin Cui
25 days
RT @prithvijitch: The WorldModelBench workshop is happening tomorrow (June 12th) at #CVPR2025! We have an exciting series of talks, do atte….
0
9
0
@YinCuiCV
Yin Cui
25 days
If you are attending #CVPR2025 tomorrow, please visit two highly relevant workshops organized by our team members:.- Vision Meets Physics: - Benchmarking World Models:
@qsh_zh
Qinsheng Zhang
25 days
many core-contributors are attending #CVPR2025 . Let’s discuss the future of world models!.
0
3
28
@YinCuiCV
Yin Cui
25 days
RT @qsh_zh: 🚀 Introducing Cosmos-Predict2!. Our most powerful open video foundation model for Physical AI. Cosmos-Predict2 significantly im….
0
60
0
@YinCuiCV
Yin Cui
25 days
RT @FangyinWei: Join us on the 1st workshop on Vision Meets Physics: Synergizing Physical Simulation and Computer Vision at #CVPR2025 tomor….
0
5
0
@YinCuiCV
Yin Cui
25 days
RT @mli0603: Cosmos-Reason1 has exciting updates 💡.Now it understands physical reality — judging videos as real or fake! Check out the reso….
0
32
0
@YinCuiCV
Yin Cui
1 month
This work is led by our amazing intern @chenhuay17 and his host @Haoxiang__Wang!. Other collaborators: @zkwthu @qsh_zh @charlesfornlp @haotian_yeee @TsungYiLinCV @liu_mingyu Jun Zhu. The code and model will be released soon!.
0
0
4
@YinCuiCV
Yin Cui
1 month
As a supervised learning method, NFT outperforms leading RL algorithms like GRPO and DAPO in 7B model experiments and performs similarly to DAPO in 32B settings. We conduct 3-4 independent experiments in 7B training to reduce random noise.
Tweet media one
1
0
3
@YinCuiCV
Yin Cui
1 month
Why is it even possible to optimize your LLMs on negative data via supervision? The key is a tight coupling between positive & negative answer distributions generated by the same LLM in online training.
Tweet media one
1
0
2
@YinCuiCV
Yin Cui
1 month
We proved that NFT is equivalent to GRPO in strict on-policy training, despite their entirely different theoretical foundations. The normalized advantage design of GRPO is implicitly reflected in NFT's objective. NFT bridges the existing RL and SL theoretical framework.
Tweet media one
1
0
2
@YinCuiCV
Yin Cui
1 month
Traditional supervised learning methods like rejection sampling fine-tuning cannot leverage negative feedback. The key to NFT’s success lies in its effective use of negative feedback, by constructing an implicit negative policy parameterized by the target LLM. Training this
Tweet media one
1
0
4
@YinCuiCV
Yin Cui
1 month
Is self-improvement exclusive to RL?. Can we use supervised learning to match LLMs trained with SOTA RL algorithms?. In Negative-aware Fine-Tuning (NFT), we introduce a purely supervised learning method to enhance LLMs' math reasoning with no external teachers. NFT matches or
Tweet media one
Tweet media two
Tweet media three
3
39
185
@YinCuiCV
Yin Cui
2 months
RT @OriolVinyalsML: Today we introduced Gemini Diffusion⚡️ (& DeepThink, Veo3, Imagen4, 2.5 updates. ). It's been a dream of mine to remo….
0
102
0