Sahil Verma @Sahil1V X Profile

Sahil Verma

@Sahil1V

Followers

586

Following

6K

Media

17

Statuses

583

PhD student @uwcse. Robustness and Interpretability. Currently at @MSFTResearch. Former intern at @amazon, @itsArthurAI. Undergrad @IITKanpur

Seattle, WA

Joined September 2013

Don't wanna be here? Send us removal request.

Sahil Verma

@Sahil1V

11 days

Glad to share that our paper was accepted the main EMNLP 2025 Conference!.

Sahil Verma

@Sahil1V

3 months

🚨 New Paper! 🚨.Guard models slow, language-specific, and modality-limited?. Meet OmniGuard that detects harmful prompts across multiple languages & modalities all using one approach with SOTA performance in all 3 modalities!! while being 120X faster 🚀.

4

7

69

Sahil Verma

@Sahil1V

1 month

RT @soumyesinghal: Llama Nemotron model just got Super-Charged ⚡️We released Llama-Nemotron-Super-v1.5 today! The best open model that can….

huggingface.co

0

7

0

Grok

@grok

4 days

Join millions who have switched to Grok.

194

397

3K

Sahil Verma

@Sahil1V

1 month

RT @_shruti_joshi_: I will be at the Actionable Interpretability Workshop (@ActInterp, #ICML) presenting *SSAEs* in the East Ballroom A fro….

0

6

0

Sahil Verma

@Sahil1V

1 month

RT @zvez11: Transformers struggle with length generalization and long context. What can we do about it?. Our new #TMLR paper with @rolandal….

0

7

0

Sahil Verma

@Sahil1V

2 months

RT @zvez11: Are you compositionally curious 🤓. Want to know how to learn embeddings using🌲?. In our new #ICML2025 paper, we present Banyan:….

0

13

0

Sahil Verma

@Sahil1V

2 months

RT @fengyao1909: 😵‍💫 Struggling with 𝐟𝐢𝐧𝐞-𝐭𝐮𝐧𝐢𝐧𝐠 𝐌𝐨𝐄?. Meet 𝐃𝐞𝐧𝐬𝐞𝐌𝐢𝐱𝐞𝐫 — an MoE post-training method that offers more 𝐩𝐫𝐞𝐜𝐢𝐬𝐞 𝐫𝐨𝐮𝐭𝐞𝐫 𝐠𝐫𝐚𝐝𝐢𝐞….

0

60

0

Sahil Verma

@Sahil1V

2 months

RT @avibose22: 🚨 Code is live! Check out LoRe – a modular, lightweight codebase for personalized reward modeling from user preferences. 📦 F….

github.com

Code to reproduce results of our experiments using LoRe - facebookresearch/LoRe

0

6

0

Sahil Verma

@Sahil1V

3 months

Using retrieval? --> check out this work by my awesome collaborator on how to increase diversity when retrieving!.

Arnav Das

@arnaved

3 months

1/8 🚀 How can retrieval augmentation be made both relevant and non-redundant for few-shot adaptation? I'm excited to introduce COBRA. Catch our poster at #CVPR25 (ExHall D, Poster #450) on Sat 14 Jun, 5–7 p.m. CDT:

0

2

7

Sahil Verma

@Sahil1V

3 months

RT @fengyao1909: 🔥 "Vibe coding" is everywhere—but is it really care-free?. We introduce 𝐑𝐞𝐚𝐋, an RL framework that trains LLMs with automa….

0

41

0

Sahil Verma

@Sahil1V

3 months

Joint work with awesome collaborators:.@keeghin, @jbilmes, @_siskac, @LukeZettlemoyer,.@hila_gonen, and @csinva. Also huge thanks to @arnaved, @butcher_jasper, @BhattGantavya, @zvez11, @faeze_brh, @MakeshNarsimhan, @soumyesinghal, @_shruti_joshi_ for helpful discussions.

0

7

Sahil Verma

@Sahil1V

3 months

OmniGuard is also the fastest and most accurate at rapid adaptation to new settings using few-shot examples (one of the purported benefits of separate Guard models built using LLMs).

1

0

3

Sahil Verma

@Sahil1V

3 months

Using the internal representations for safety classification bypasses the need for a separate Guard model while making OmniGuard 120X faster than the fastest baseline Guard model.

1

0

3

Sahil Verma

@Sahil1V

3 months

How does OmniGuard detect harmful prompts across languages—even ciphers—and modalities?. 1️⃣ Identifies universal internal representations from models (LLMs/MLLMs). 2️⃣ Builds powerful classifiers using these shared representations. A single unified guard ! 🌐🔒

1

0

3

Sahil Verma

@Sahil1V

3 months

🚨 New Paper! 🚨.Guard models slow, language-specific, and modality-limited?. Meet OmniGuard that detects harmful prompts across multiple languages & modalities all using one approach with SOTA performance in all 3 modalities!! while being 120X faster 🚀.

1

39

82

Sahil Verma

@Sahil1V

3 months

RT @jinaycodes: Introducing soarXiv ✈️, the most beautiful way to explore human knowledge. Take any paper's URL and replace arxiv with soar….

0

1K

0

Sahil Verma

@Sahil1V

5 months

RT @soumyesinghal: ⚡⚡ Llama-Nemotron-Ultra-253B just dropped: our most advanced open reasoning model.🧵👇

0

13

0

Sahil Verma

@Sahil1V

5 months

RT @soumyesinghal: ⚡️ Llama-Nemotron-Ultra is fully open — weights and post-training data. Achieves 76.0% on GPQA via FP8 RL training with….

0

4

0

Sahil Verma

@Sahil1V

6 months

RT @soumyesinghal: 🚀 Meet Llama-Nemotron-Super-49B, our team’s new reasoning model released at #GTC25! Proud to have contributed 🧠. Optimiz….

huggingface.co

0

4

0

Sahil Verma

@Sahil1V

6 months

RT @snehaark: Come for the ridiculous 30 column spreadsheet created at @sweetgreen, stay for a critical discussion of how you *actually* sc….

0

3

0

Sahil Verma

@Sahil1V

6 months

RT @_shruti_joshi_: 1\ Hi, can I get an unsupervised sparse autoencoder for steering, please? I only have unlabeled data varying across mul….

0

10

0