
Torsten Scholak
@tscholak
Followers
2K
Following
51K
Media
175
Statuses
4K
Lead Research Scientist, Foundation Models Lab @ServiceNowRSRCH. Opinions are not that of my employer.
Montréal
Joined February 2010
🚨🤯 Today Jensen Huang announced SLAM Lab's newest model on the @HelloKnowledge stage: Apriel‑Nemotron‑15B‑Thinker 🚨.A lean, mean reasoning machine punching way above its weight class 👊.Built by SLAM × NVIDIA. Smaller models, bigger impact. 🧵👇
2
22
47
RT @alex_lacoste_: 🚨 Is #WorkArena on the verge of being solved? Or did GPT-5 just get trained on it?. 🔥While some benchmarks show modest g….
0
24
0
RT @GabrielHuang9: As #ICML2025 kicks off in Vancouver, our AI talent is being quietly pushed out. 🇨🇦. We've been waiting 28 months for per….
0
10
0
Nice release! Worth noting the MoE x Mamba gives coverage, not multiplicative speed-ups:.* small batch: expert sparsity keeps latency low.* medium-large batch: Mamba's KV-free scan scales while attention would choke.Net: below dense latency across the board, but no compounding.
Crazy that we now have an open source model with 13B params that’s competitive w o1. And Mamba layers help bring much higher inference throughput.
0
0
5
RT @joanrod_ai: Thanks @_akhaliq for sharing our work! Excited to present our next generation of SVG models, now using Reinforcement Learni….
0
41
0
RT @PShravannayak: 🚀 Excited to share that UI-Vision has been accepted at ICML 2025! 🎉. We have also released the UI-Vision grounding datas….
huggingface.co
0
15
0
RT @NVIDIAAI: 🚀 Announced at #Knowledge25: @ServiceNow & @nvidia introduce Apriel Nemotron 15B. Apriel Nemotron 15B is a compact, cost-eff….
0
54
0
RT @ServiceNowNews: Together with @NVIDIA, we're launching a new class of intelligent AI agents. Our Apriel Nemotron 15B model, co-develope….
0
19
0
Try it, tune it, test it out!.Huge thanks to the entire SLAM Lab, @carnaticfiddle, @nvidia, and everyone who contributed. 🙌.#Apriel #Nemotron #FastLLM #ServiceNow #NVIDIA #LLM #AI.
0
0
5
🏗️ Built in a 3-stage pipeline:.1️⃣ CPT: 100B+ tokens (math, science, logic, coding).2️⃣ SFT: 200K curated instructions.3️⃣ RL (GRPO): sharp instruction-following & coding.(+ periodic snapshot merges to prevent forgetting).Made possible by Fast-LLM,
github.com
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research - ServiceNow/Fast-LLM
1
0
5
RT @DBahdanau: I am excited to open-source PipelineRL - a scalable async RL implementation with in-flight weight updates. Why wait until yo….
0
115
0
RT @DBahdanau: AI folks in ServiceNow have been cooking. And they cooked a very delicious small 5B parameter cookie!.
0
2
0
RT @Dorialexander: There isn’t that many newcomers in the SLM space and this one looks very interesting. MIT base models, new open source p….
0
5
0
RT @RajeswarSai: Showing off Apriel-5B 🚀, an efficient and effective compact model yet. Congrats to the whole SLAM team led by @tscholak @….
0
1
0
RT @ostap__alex: Exciting release from ServiceNow Research — introducing Apriel-5B, a compact and efficient open-source language model that….
0
2
0
RT @sebpaquet: This new, speedy and efficient language model arose from a fruitful collaboration between two teams at ServiceNow! Pretrain….
github.com
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research - ServiceNow/Fast-LLM
0
1
0