Sathwik Tejaswi @SathwikTejaswi X Profile

Sathwik Tejaswi

@SathwikTejaswi

Followers

66

Following

34

Media

0

Statuses

17

Technical Co-Lead of Apriel Mid Training and Post training https://t.co/7CEEfb5Fib

SF Bay Area

Joined April 2016

Don't wanna be here? Send us removal request.

Sathwik Tejaswi

@SathwikTejaswi

2 months

Thank you @mervenoyann for the shout-out!

merve

@mervenoyann

2 months

the new Apriel-1.5 reasoning vision language model by @ServiceNowRSRCH is so good! 🔥😮 here's a small vibe test across languages⤵️ > ask to identify drug interactions in French label in English > it compares minerals > finally comes up with a look-up table with correct list!

0

2

Lysandre

@LysandreJik

2 months

ServiceNow-AI/Apriel-1.5-15b-Thinker running on a single GPU using `transformers serve` 🔥 great to have some very nice reasoning models that can run locally! next step, trying it on mps 👀

0

9

55

NVIDIA AI

@NVIDIAAI

2 months

👏 Congratulations to @ServiceNowRSRCH on introducing Apriel-1.5-15B-Thinker — a powerful new AI model that delivers frontier-level reasoning with a fraction of the compute. We’re proud that our Nemotron collection helped power its training .

NVIDIA AI Developer

@NVIDIAAIDev

2 months

🎊 Congratulations to @ServiceNowRSRCH on introducing Apriel-1.5-15B-Thinker — their 15B-parameter model that matches DeepSeek-R1-0528, Mistral-medium-1.2 and Gemini Flash 2.5 on the Artificial Analysis Index (AAI 52) — delivering comparable results at fraction of the size (at

5

9

78

Turing

@turingcom

2 months

𝐁𝐑𝐄𝐀𝐊𝐈𝐍𝐆: @ServiceNow released a 15B parameter AI model today. The model is the product of a partnership with Turing, which provided the training data. Breakdown below.

5

20

142

Artificial Analysis

@ArtificialAnlys

2 months

ServiceNow has released Apriel-v1.5-15B-Thinker, a 15B open weights reasoning model that leads our Small Models category (<40B parameters) 💼 Overview: Apriel-v1.5-15B-Thinker is a dense, 15B parameter open weights reasoning model. This is not the first model ServiceNow has

19

61

502

NVIDIA AI Developer

@NVIDIAAIDev

2 months

🎊 Congratulations to @ServiceNowRSRCH on introducing Apriel-1.5-15B-Thinker — their 15B-parameter model that matches DeepSeek-R1-0528, Mistral-medium-1.2 and Gemini Flash 2.5 on the Artificial Analysis Index (AAI 52) — delivering comparable results at fraction of the size (at

12

23

180

ServiceNow AI Research

@ServiceNowRSRCH

2 months

SLAM Labs presents Apriel-1.5-15B-Thinker 🚀 An open-weights multimodal reasoning model that hits frontier-level performance with just a fraction of the compute.

15

77

337

Francesco Bertolotti

@f14bertolotti

3 months

This is an interesting technical LLM report. This 15B model beats QwQ32B while using quite fewer tokens. Most interestingly, the authors heavily use model merging to combine the strengths of different checkpoints. 🔗 https://t.co/thoIqNEeBd

5

48

346

Vikas Yadav

@Vikas_NLP_UA

6 months

🎉 Our work “Variable Layerwise Quantization: A Simple and Effective Approach to Quantize LLMs” is accepted at #ACLFindings2025 📎 https://t.co/7fKAnZQIBr — Keep key layers high-precision, push others lower → compact LLMs w/ ~no accuracy loss — Simple LIM & ZD scores rank layers

arxiv.org

We present a simple meta quantization approach that quantizes different layers of a large language model (LLM) at different bit levels, and is independent of the underlying quantization technique....

1

3

6

Torsten Scholak

@tscholak

7 months

🚨🤯 Today Jensen Huang announced SLAM Lab's newest model on the @HelloKnowledge stage: Apriel‑Nemotron‑15B‑Thinker 🚨 A lean, mean reasoning machine punching way above its weight class 👊 Built by SLAM × NVIDIA. Smaller models, bigger impact. 🧵👇

2

22

47

Torsten Scholak

@tscholak

8 months

🚨 SLAM Labs presents Apriel-5B! And it lands right in the green zone 🚨 Speed ⚡ + Accuracy 📈 + Efficiency 💸 This model punches above its weight, beating bigger LLMs while training on a fraction of the compute. Built with Fast-LLM, our in-house training stack. 🧵👇

5

49

134

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

1 year

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks abs: https://t.co/l6wHdrGAt5 project page: https://t.co/55UGlS3FLQ BigDocs-7.5M is a high-quality, open-access dataset comprising 7.5 million multimodal documents across

2

28

142

Perouz Taslakian

@PerouzT

1 year

🌟🌟🌟 We just released BigDocs: An Open Multimodal Dataset — our latest work on scaling document understanding across diverse data types! 📄 👉 Dive into the details: https://t.co/KfOKZKARDS 🧠 or come see us at the #NeurIPS2024 RBFM workshop! #AI @ServiceNowRSRCH #bigdocs

0

15

17

Vaibhav Adlakha

@vaibhav_adlakha

2 years

We introduce LLM2Vec, a simple approach to transform any decoder-only LLM into a text encoder. We achieve SOTA performance on MTEB in the unsupervised and supervised category (among the models trained only on publicly available data). 🧵1/N Paper: https://t.co/1ARXK1SWwR

13

165

874

Vikas Yadav

@Vikas_NLP_UA

2 years

📢📢Excited to share our new work 🍛CurryDPO 1/2 🔴Systematically curates multiple preference pairs and trains upon them in a curriculum learning setup with DPO framework 🔴Achieves notable performance gains over vanilla DPO method on MTbench, Vicuna, WizardLM, and UltraFeedback

1

12

19