Oleksii Kuchaiev
@kuchaev
Followers
3K
Following
22K
Media
58
Statuses
777
Director of AI model post-training @NVIDIA
in the cloud
Joined February 2010
We are excited to release Nvidia-Nemotron-Nano-V2 model! This is a 9B hybrid SSM model with open base model and training data. This model also supports runtime "thinking" budget control. HF collection with base and post trained models: https://t.co/n3M01d8lSm
10
61
299
If someone approaches you to talk about "agents", always ask them for their definition of what it is. Often a good signal on whether to continue or avoid the conversation.
0
0
12
Another mass attack against a completely innocent country. The only fair resolution to this is for russia to be completely destroyed as a nation. Isolated, cut off, Balkanised. The Kremlin should be levelled and Moscow should be without power forever.
2K
2K
8K
When you run AI on your device, it is more efficient and less big brother and free! So it's very cool to see the new llama.cpp UI, a chatgpt-like app that fully runs on your laptop without needing wifi or sending any data external to any API. It supports: - 150,000+ GGUF models
52
178
2K
@huggingface Papers: https://t.co/YKnALzvtIg
https://t.co/qLy5J9S4uf Models in this HF collection: https://t.co/OZNWBkDtua
huggingface.co
0
0
0
New Reward modeling research and models from our team! 1. "RLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards" 2. "Think Twice: Branch-and-Rethink Reasoning Reward Model" As usual, models are on @huggingface Hub. Links in the reply.
1
1
12
NVIDIA is really committed to this open models thing - like you wouldn’t believe! New models dropped during the keynote: - NVIDIA Nemotron Nano 2 VL - NVIDIA Nemotron Parse 1.1 - Llama 3.1 Nemotron Safety Guard With a bunch of other models being moved from closed to open on
developer.nvidia.com
Agentic AI is an ecosystem where specialized language and vision models work together. They handle planning, reasoning, retrieval, and safety guardrailing. Developers need specialized AI agents for…
4
10
73
Nemotron Nano v2 can now do multi-image reasoning and video understanding, along with strong document intelligence, visual Q&A and summarization capabilities.
1
0
6
👀 What happens when AI goes open source? NVIDIA leaders share how Nemotron, our family of open models, is changing how the world builds, customizes, and trusts AI. 🎧 Listen now: https://t.co/hfwD2tpwCA
6
24
98
This picture of Kharkiv firefighters evacuating children after russian strike on kindergarten must be on every front page!
403
5K
12K
🧠 At Open Source AI Week we can’t wait to learn how the community is using #opensource projects to redefine how AI is developed, scaled, and shared across text, image, audio, video, and multimodal tasks. To help accelerate innovation, we are now a top contributor on
2
9
76
@NVIDIAAIDev is building open models, with fairly open data, and open recipes. But we don't want to do it in a vacuum, so we made a thing! You can add and vote on ideas submitted by the community to help make our models more useful, and better, and faster (and stronger).
1
1
6
✈️to COLM2025. And I am looking for exceptional RL and post-training engineers who are excited to push frontiers of open-source post-training and open models such as Nemotron. • At the conference? Message me on Whova. • Not attending? DMs are open. Send your CV & a short note.
2
10
190
Last night, Ukraine once again came under a combined Russian attack – more than 50 missiles and about 500 attack drones. The Russians struck with cruise missiles, “shaheds” and Kinzhals among other things. The Lviv, Ivano-Frankivsk, Zaporizhzhia, Chernihiv, Sumy, Kharkiv,
1K
5K
15K
We are alarmed by reports that Germany is on the verge of a catastrophic about-face, reversing its longstanding and principled opposition to the EU’s Chat Control proposal which, if passed, could spell the end of the right to privacy in Europe. https://t.co/015qmQnIS2
743
9K
31K
Are you ready for web-scale pre-training with RL ? 🚀 🔥 New paper: RLP : Reinforcement Learning Pre‑training We flip the usual recipe for reasoning LLMs: instead of saving RL for post‑training, we bring exploration into pretraining. Core idea: treat chain‑of‑thought as an
22
113
707
A great video by @ctnzr on what Nemotron is and why it is open https://t.co/PYxyHUJxPp !
0
3
11
"At least 10 gigawatts of AI data centers with NVIDIA systems representing millions of GPUs for OpenAI’s next-generation AI infrastructure"
NEWS: OpenAI and NVIDIA announce a landmark strategic partnership that will see OpenAI deploy millions of NVIDIA GPUs. 🔗 https://t.co/hSZJ3cfXQn This agreement helps OpenAI meet surging demand for more powerful, efficient, and cost-effective AI training and inference at scale
0
0
2
My heart goes out to all the families and individuals anxious over their futures following the abrupt and chaotic announcement of H-1B visa changes. America should be working to attract more skilled talent, not create uncertainly that turns them away. To all legal immigrants
590
555
7K