Nishant Subramani @nsubramani23 X Profile

Nishant Subramani

@nsubramani23

Followers

723

Following

6K

Media

15

Statuses

749

PhD student @LTIatCMU working on model interpretability; student researcher @google // Prev: intern @msftresearch, predoc @allen_ai // @BVB supporter // he/him

Seattle, WA

Joined January 2012

Don't wanna be here? Send us removal request.

Nishant Subramani

@nsubramani23

2 years

Excited to announce that I'll be starting my PhD at @LTIatCMU this Fall working on generation and controlling LMs 🥳! Big thank you to my mentors + letter writers @mmitchell_ai, @_DougDowney and @mattthemathman and all my collaborators at @allen_ai for their invaluable support ❤️.

22

8

156

Nishant Subramani

@nsubramani23

1 month

RT @bearseascape: 🚨New #interpretability paper with @nsubramani23 :🕵️Model Internal Sleuthing: Finding Lexical Identity and Inflectional Mo….

0

2

0

Nishant Subramani

@nsubramani23

1 month

🚨 Check out our new #interpretability paper: 🕵🏽 Model Internal Sleuthing led by the amazing @bearseascape who is an undergrad at @SCSatCMU @LTIatCMU!.

Michael Li

@bearseascape

1 month

🚨New #interpretability paper with @nsubramani23 :🕵️Model Internal Sleuthing: Finding Lexical Identity and Inflectional Morphology in Modern Language Models

0

5

28

Nishant Subramani

@nsubramani23

1 month

Excited to announce that I started at @googlecloud as a student researcher last month working with @hamidpalangi on actionable #interpretability 🔍 to build better tool using #agents ⚒️🤖.

2

30

Nishant Subramani

@nsubramani23

2 months

Presenting this today at the poster session at #NAACL2025!. Come chat about interpretability, trustworthiness, and tool-using agents!. 🗓️ - Thursday May 1st (today).📍 - Hall 3.🕑 - 200-330pm.

Nishant Subramani

@nsubramani23

2 months

🚀 Excited to share a new interp+agents paper: 🐭🐱 MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools appearing at #NAACL2025. This was work done @Microsoft last summer with @adveisner @justinsvegliato @ben_vandurme @ysu_nlp @sammthomson . 1/🧵

0

2

22

Nishant Subramani

@nsubramani23

2 months

At #NAACL2025 🌵till Sunday! Love to chat about interpretability, understanding model internals, and finding vegan food 🥬.

Nishant Subramani

@nsubramani23

2 months

🚀 Excited to share a new interp+agents paper: 🐭🐱 MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools appearing at #NAACL2025. This was work done @Microsoft last summer with @adveisner @justinsvegliato @ben_vandurme @ysu_nlp @sammthomson . 1/🧵

0

2

15

Nishant Subramani

@nsubramani23

2 months

Come to our poster in Albuquerque on Thursday 2-330pm in the interpretability & analysis section! . Paper: Code (coming soon): 🧵/🧵.

0

Nishant Subramani

@nsubramani23

2 months

MICE 🐭: .🎯 - significantly beats baselines on expected tool-calling utility, especially in high risk scenarios .✅ - matches expected calibration error of baselines .✅ - is sample efficient .✅ - generalizes zeroshot to unseen tools . 5/🧵

1

0

Nishant Subramani

@nsubramani23

2 months

Calibration is not sufficient: both an oracle and a model that just predicts the base rate are perfectly calibrated🤦🏽‍♂️. We develop a new metric expected tool-calling utility 🛠️to measure the utility of deciding whether or not to execute a tool call via a confidence score! . 4/🧵.

1

0

Nishant Subramani

@nsubramani23

2 months

We propose 🐭 MICE to better assess confidence when calling tools: .1️⃣ decode from each intermediate layer of an LM .2️⃣ compute similarity scores between each layer’s generation and the final output. 3️⃣ train a probabilistic classifier on these features . 3/🧵

1

0

Nishant Subramani

@nsubramani23

2 months

1️⃣ Tool-using agents need to be useful and safe as they take actions in the world .2️⃣ Language models are poorly calibrated .🤔 Can we use model internals to better calibrate language models to make tool-using agents safer and more useful? . 2/🧵.

2

0

1

Nishant Subramani

@nsubramani23

2 months

🚀 Excited to share a new interp+agents paper: 🐭🐱 MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools appearing at #NAACL2025. This was work done @Microsoft last summer with @adveisner @justinsvegliato @ben_vandurme @ysu_nlp @sammthomson . 1/🧵

2

11

63

Nishant Subramani

@nsubramani23

8 months

RT @claranahhh: Building/customizing your own LLM? You'll want to curate training data for it, but how do you know what makes the data good….

0

35

0

Nishant Subramani

@nsubramani23

8 months

RT @apoorvkh: Wondering how long it takes to train a 1B-param LM from scratch on your GPUs? 🧵. See our paper to learn about the current sta….

0

98

0

Nishant Subramani

@nsubramani23

10 months

RT @TEKnologyy: 🚀 Excited to announce our latest paper, "Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation"….

0

21

0

Nishant Subramani

@nsubramani23

11 months

RT @JesseDodge: Congrats to our team for winning two paper awards at #ACL2024!. OLMo won the Best Theme Paper award, and Dolma won a Best R….

0

42

0

Nishant Subramani

@nsubramani23

1 year

Presenting our paper "Evaluating Personal Information Parroting in Language Models" joint work with @GhateKshitish @MonaDiab77 from @LTIatCMU at the @trustnlp poster session now-330p today. Poster 64! #NAACL2024

0

4

40

Nishant Subramani

@nsubramani23

1 year

Great talk by @abeirami at @trustnlp on algorithmic LM alignment with a recipe to approach alignment problems #NAACL2024

0

12

Nishant Subramani

@nsubramani23

1 year

Great talk on linear relational concepts in LLMs on work by @chanindav, Anthony Hunter, and @oanacamb from @ucl_nlp #NAACL2024. Paper:

0

2

14

Nishant Subramani

@nsubramani23

1 year

Cool interpretability poster from @jack_merullo_ @Brown_NLP on showing that LMs do word2vsc style arithmetic in their latent spaces for analogy type tasks #NAACL2024

0

7

46

Nishant Subramani

@nsubramani23

1 year

In Mexico City for #NAACL2024 through next Sunday! Hoping to chat with folks thinking about model interpretability research 🔍, explore vegan food 🌮, learn about the indigenous cultures 🐍🐆🦅 of the region, and explore the history of modern Mexico 🇲🇽 - DM me!.

0

1

25