kuchaev Profile Banner
Oleksii Kuchaiev Profile
Oleksii Kuchaiev

@kuchaev

Followers
2K
Following
18K
Media
51
Statuses
709

Director, AI model post-training @NVIDIA

in the cloud
Joined February 2010
Don't wanna be here? Send us removal request.
@kuchaev
Oleksii Kuchaiev
3 months
We are excited to release Llama-Nemotron-Ultra! This is a reasoning ON/OFF, dense 253B model. Open weights and post-training data. We started with llama-405B, changed it via NAS pruning then followed by reasoning-focused post-training: SFT + RL in FP8.
Tweet media one
Tweet media two
24
124
698
@kuchaev
Oleksii Kuchaiev
8 hours
RT @igtmn: We've released a series of OpenReasoning-Nemotron models (1.5B, 7B, 14B and 32B) that set new SOTA on a wide range of reasoning….
0
29
0
@kuchaev
Oleksii Kuchaiev
10 hours
RT @NVIDIAAIDev: 📣 Announcing the release of OpenReasoning-Nemotron: a suite of reasoning-capable LLMs which have been distilled from the D….
0
73
0
@kuchaev
Oleksii Kuchaiev
2 days
✈️ to ICML workshops to talk about the first open-weight model that outsmarted original DS-R1 on AA index. Happy to chat all things post-training and AI in general. (The poster is EXAIT workshop this Saturday)
Tweet media one
0
1
24
@kuchaev
Oleksii Kuchaiev
4 days
If you are a researcher working on LLM post-training, RL and reasoning, you should really give NeMo-RL a try. Works with hugginface and megatron-core (when you need scale). Here is great blogpost by @AlexanderBukha1 and team on how to get started:
0
25
196
@kuchaev
Oleksii Kuchaiev
10 days
RT @BanghuaZ: Really excited to work with @AndrewYNg and @DeepLearningAI on this new course on post-training of LLMs—one of the most creat….
0
45
0
@kuchaev
Oleksii Kuchaiev
15 days
RT @United24media: Stop Russian nightly terror. Help Ukraine protect its skies. DONATE 👇 .
0
346
0
@kuchaev
Oleksii Kuchaiev
15 days
RT @RayDalio: Now that the budget bill has passed Congress, we can see what the projections look like for deficits, government debt, and de….
0
5K
0
@kuchaev
Oleksii Kuchaiev
16 days
Post-training of LLMs is increasingly important and RLHF remains a necessary step for an overall great model. Today we are releasing 6 new reward models, including GenRMs and multilingual. These models are used to post-train next *-nemotron models.
3
64
217
@kuchaev
Oleksii Kuchaiev
1 month
RT @ctnzr: NVIDIA benefits greatly from the open-source community, and we're excited to be able to contribute back. It's great to see so mu….
0
15
0
@kuchaev
Oleksii Kuchaiev
1 month
RT @BohuslavskaKate: Kyiv this morning ‼️
0
1K
0
@kuchaev
Oleksii Kuchaiev
1 month
RT @razomforukraine: ‼️ As G7 leaders meet in Canada, Russia sends a clear message by bombing Kyiv. Homes destroyed, kindergarten hit, civi….
0
175
0
@kuchaev
Oleksii Kuchaiev
1 month
RT @CAgovernor: Californians: If you’re protesting today, protect one another and hold the line for peace. There’s no place for violence i….
0
3K
0
@kuchaev
Oleksii Kuchaiev
1 month
AI model post training is rapidly improving. The plot below (starting from the same base model) illustrates about 10 months of progress in the *open* post-training research. I’m not convinced that closed research can move as fast.
Tweet media one
1
4
22
@kuchaev
Oleksii Kuchaiev
1 month
New reasoning Nemotron-H models are now publicly available. These models are based on hybrid architecture! .47B and 8B in BF16 and FP8. Blogpost: Weights:
@rendu_a
Adi Renduchintala
1 month
Transformers are still dominating the LLM scene but we show that higher throughput alternatives exist which are just as strong! . Grateful to have a part in Nemotron-H Reasoning effort. 🙏 Technical report will be out soon, stay tuned!.
1
25
123
@kuchaev
Oleksii Kuchaiev
2 months
RT @AndrewYNg: I am alarmed by the proposed cuts to U.S. funding for basic research, and the impact this would have for U.S. competitivenes….
0
478
0
@kuchaev
Oleksii Kuchaiev
2 months
NVIDIA Blackwell: The Journey From Die to Data Center via @YouTube.
0
1
11
@kuchaev
Oleksii Kuchaiev
2 months
RT @nvidianewsroom: NVIDIA today reported record revenue for Q1 FY26 of $44.1 billion, up 12% from the previous quarter and up 69% from a y….
0
179
0
@kuchaev
Oleksii Kuchaiev
2 months
RT @ctnzr: Nemotron-CORTEXA just reached the top of the SWEBench leaderboard for using LLMs to solve software engineering problems, solving….
0
36
0
@kuchaev
Oleksii Kuchaiev
2 months
RT @rshereme: Russia killed them. This morning, while you were having breakfast and coffee, russians murdered three Ukrainian children.….
0
2K
0
@kuchaev
Oleksii Kuchaiev
2 months
RT @maria_avdv: Brutal night for Kyiv and all of Ukraine. Russia launched massive combined attack on civilians: ballistic missiles, Shahed….
0
1K
0