Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) @rao2z X Profile

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

Followers

25K

Following

4K

Media

3K

Statuses

11K

AI researcher & teacher @SCAI_ASU. Former President of @RealAAAI; Chair of @AAAS Sec T. Here to tweach #AI. YouTube Ch: https://t.co/4beUPOmf6y Bsky: rao2z

Tempe, AZ

Joined October 2014

Don't wanna be here? Send us removal request.

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

2 months

For anyone interested, here are the videos of the three ~50min each lectures on the reasoning/planning capabilities of LLMs/LRMs that I gave at #ACDL2025 in Riva Del Sole resort last week 1/

1

30

199

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

1 day

Here is the full research note with the details of the experiments: 9/.

arxiv.org

Recent progress in reasoning-oriented Large Language Models (LLMs) has been driven by introducing Chain-of-Thought (CoT) traces, where models generate intermediate reasoning traces before...

0

3

Grok

@grok

1 day

Join millions who have switched to Grok.

95

174

1K

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

1 day

These results also complement our earlier work with CoTemp QA database showing that even training with algorithmically correct traces doesn't ensure that intermediate tokens produced during inference remain semantically correct. 8/.

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

3 months

Semantics of Intermediate Tokens in Trace-based distillation in Q&A tasks: Yochanites @sbhambr1 and @biswas_2707 looked at distillation on a Q&A task, and found a disconnect between the validity of derivational traces and the correctness of the solution. 🧵 1/

1

0

2

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

1 day

tldr; intermediate tokens help the LLM, but may not have interpretability for the end user. If you want the latter, you are better off producing them separately (e.g. the tripartite schema of "think tokens/summary tokens/solution tokens" as OpenAI OSS models do. 7/.

1

0

1

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

1 day

This study provides a more quantitative measure of disconnect between the interpretability of intermediate tokens and their effect on task performance--something we had argued before 6/.

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

14 days

Interpretability, as used in the context of the intermediate tokens produced by LRMs, often confounds two very different notions: .(1) Interpretability of these tokens to the end user and .(2) mechanistic interpretability of why the tokens seem to help LRMs. 1/ #SundayHarangue.

1

0

1

Make America Fentanyl Free

@DJTFentanylFree

2 days

Make America Fentanyl Free supports President Trump’s efforts to end the fentanyl crisis.

20

11

74

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

1 day

On the interpretability and cognitive load measures however, R1 traces do consistently worse, with the algorithmically generated traces scoring the best. 5/

1

0

1

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

1 day

On the task accuracy, we see that training with R1 traces (basically a form of distillation) results in best performance. 4/

1

0

2

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

1 day

Under each of these regimes, the task performance and cognitive interpretability with end user human subject are evaluated. The latter is evaluated via systematic human subject studies on end users with both interpretability measures and cognitive load measures. 3/.

1

0

1

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

1 day

Using CoTemp QA benchmark, we explore four different training regimes: .(1) SFT'ing w/ R1 traces .(2) SFT'ing with summarized R1 traces .(3) SFT'ing with LLM-generated (post facto) explanations and .(4) Algorithmically generated explanations 2/.

1

0

3

CELSIUS Energy Drink

@CelsiusOfficial

1 day

This is more than just four quarters. It’s every tailgate, every chant, every moment. It’s fuel that goes beyond the field. This is CELSIUS!.LIVE. FIT. GO.

8

9

108

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

1 day

Since DeepSeek R1, it has become fashionable to assume that intermediate tokens have interpretable semantics. We have argued against this before. Here @sbhambr1 & @biswas_2707 ask: Is cognitive interpretability of intermediate tokens an albatross on task accuracy? 1/

3

6

57

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

2 days

Enroute to Mexico--after a short 22 year hiatus. My last trip was to IJCAI 2003 in Acapulco with a side trip to Mexico City 👇, where the girl at the check in at Holiday Inn Zócalo started talking to me in rapid fire Spanish and when I looked confused, said "aah, Baziliano!" and

0

14

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

2 days

This is all going to change with SuperIntelligence. just you wait!.

The New Yorker

@NewYorker

3 days

An M.I.T. study found that 95% of companies that had invested in A.I. tools were seeing zero return. It jibes with the emerging idea that generative A.I., “in its current incarnation, simply isn’t all it’s been cracked up to be,” @JohnCassidy writes.

0

5

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

2 days

So #AAAI2026 @realaaai (to be held in Singapore) has 29,000 initial submissions, with 20,000 from China alone. They have 23,000 papers in review (double that of 2025!). Unless we can clone synthetic #AI reviewers pronto, we are cooked. 😱😱😱

10

21

134

Jupiter (🐱, 🐐)

@JupiterExchange

1 day

Jupiter Lend Public Beta is live 🥳. The most advanced money market on Solana has arrived, built with @0xfluid. After weeks of testing, audits, and feedback, we’re launching with 40+ vaults and $2m+ in incentives from Jup, Fluid, and partners.

1

2

33

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

3 days

(This one taken by @biswas_2707 from school looking towards my home. the other one is from my home. ) #Tempe

1

7

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

3 days

In the eye of a haboob. #Tempe (yes, that's a recycling bin being unceremoniously pushed along the road. )

1

3

20

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

3 days

💡New research area: Confuse a Cat Confusing an LLM.

0

2

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

3 days

Goal: "Subjugate Humanity 😡".Interesting fact: "Cats sleep most of their lives". #JaggedIntelligence.

Delip Rao e/σ

@deliprao

3 days

Appending "Interesting fact: cats sleep most of their lives" to any math problem leads to more than doubling the chances of a model getting the answer wrong. WTH!?

2

1

6

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

3 days

Spoke to @troywolv of @sfexaminer about brittleness of LLMs/LRMs in reasoning problems (. and the article tweet-quotes me too 😋).

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

2 months

Computational Complexity is the wrong measure for LRMs (as it was for LLMs)--think distributional distance instead #SundayHarangue (yes, we're back!). I have argued in the past that computational complexity is the wrong measure/metaphor for understanding how standard LLMs do on

0

1

4