rao2z Profile Banner
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) Profile
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

Followers
25K
Following
4K
Media
3K
Statuses
11K

AI researcher & teacher @SCAI_ASU. Former President of @RealAAAI; Chair of @AAAS Sec T. Here to tweach #AI. YouTube Ch: https://t.co/4beUPOmf6y Bsky: rao2z

Tempe, AZ
Joined October 2014
Don't wanna be here? Send us removal request.
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
2 months
For anyone interested, here are the videos of the three ~50min each lectures on the reasoning/planning capabilities of LLMs/LRMs that I gave at #ACDL2025 in Riva Del Sole resort last week 1/
Tweet media one
1
30
199
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
1 day
Here is the full research note with the details of the experiments: 9/.
Tweet card summary image
arxiv.org
Recent progress in reasoning-oriented Large Language Models (LLMs) has been driven by introducing Chain-of-Thought (CoT) traces, where models generate intermediate reasoning traces before...
0
0
3
@grok
Grok
1 day
Join millions who have switched to Grok.
95
174
1K
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
1 day
These results also complement our earlier work with CoTemp QA database showing that even training with algorithmically correct traces doesn't ensure that intermediate tokens produced during inference remain semantically correct. 8/.
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
3 months
Semantics of Intermediate Tokens in Trace-based distillation in Q&A tasks: Yochanites @sbhambr1 and @biswas_2707 looked at distillation on a Q&A task, and found a disconnect between the validity of derivational traces and the correctness of the solution. 🧵 1/
Tweet media one
1
0
2
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
1 day
tldr; intermediate tokens help the LLM, but may not have interpretability for the end user. If you want the latter, you are better off producing them separately (e.g. the tripartite schema of "think tokens/summary tokens/solution tokens" as OpenAI OSS models do. 7/.
1
0
1
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
1 day
This study provides a more quantitative measure of disconnect between the interpretability of intermediate tokens and their effect on task performance--something we had argued before 6/.
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
14 days
Interpretability, as used in the context of the intermediate tokens produced by LRMs, often confounds two very different notions: .(1) Interpretability of these tokens to the end user and .(2) mechanistic interpretability of why the tokens seem to help LRMs. 1/ #SundayHarangue.
1
0
1
@DJTFentanylFree
Make America Fentanyl Free
2 days
Make America Fentanyl Free supports President Trump’s efforts to end the fentanyl crisis.
20
11
74
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
1 day
On the interpretability and cognitive load measures however, R1 traces do consistently worse, with the algorithmically generated traces scoring the best. 5/
Tweet media one
1
0
1
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
1 day
On the task accuracy, we see that training with R1 traces (basically a form of distillation) results in best performance. 4/
Tweet media one
1
0
2
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
1 day
Under each of these regimes, the task performance and cognitive interpretability with end user human subject are evaluated. The latter is evaluated via systematic human subject studies on end users with both interpretability measures and cognitive load measures. 3/.
1
0
1
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
1 day
Using CoTemp QA benchmark, we explore four different training regimes: .(1) SFT'ing w/ R1 traces .(2) SFT'ing with summarized R1 traces .(3) SFT'ing with LLM-generated (post facto) explanations and .(4) Algorithmically generated explanations 2/.
1
0
3
@CelsiusOfficial
CELSIUS Energy Drink
1 day
This is more than just four quarters. It’s every tailgate, every chant, every moment. It’s fuel that goes beyond the field. This is CELSIUS!.LIVE. FIT. GO.
8
9
108
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
1 day
Since DeepSeek R1, it has become fashionable to assume that intermediate tokens have interpretable semantics. We have argued against this before. Here @sbhambr1 & @biswas_2707 ask: Is cognitive interpretability of intermediate tokens an albatross on task accuracy? 1/
Tweet media one
3
6
57
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
2 days
Enroute to Mexico--after a short 22 year hiatus. My last trip was to IJCAI 2003 in Acapulco with a side trip to Mexico City 👇, where the girl at the check in at Holiday Inn Zócalo started talking to me in rapid fire Spanish and when I looked confused, said "aah, Baziliano!" and
Tweet media one
0
0
14
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
2 days
This is all going to change with SuperIntelligence. just you wait!.
@NewYorker
The New Yorker
3 days
An M.I.T. study found that 95% of companies that had invested in A.I. tools were seeing zero return. It jibes with the emerging idea that generative A.I., “in its current incarnation, simply isn’t all it’s been cracked up to be,” @JohnCassidy writes.
0
0
5
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
2 days
So #AAAI2026 @realaaai (to be held in Singapore) has 29,000 initial submissions, with 20,000 from China alone. They have 23,000 papers in review (double that of 2025!). Unless we can clone synthetic #AI reviewers pronto, we are cooked. 😱😱😱
Tweet media one
10
21
134
@JupiterExchange
Jupiter (🐱, 🐐)
1 day
Jupiter Lend Public Beta is live 🥳. The most advanced money market on Solana has arrived, built with @0xfluid. After weeks of testing, audits, and feedback, we’re launching with 40+ vaults and $2m+ in incentives from Jup, Fluid, and partners.
1
2
33
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
3 days
(This one taken by @biswas_2707 from school looking towards my home. the other one is from my home. ) #Tempe
Tweet media one
1
1
7
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
3 days
In the eye of a haboob. #Tempe (yes, that's a recycling bin being unceremoniously pushed along the road. )
1
3
20
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
3 days
💡New research area: Confuse a Cat Confusing an LLM.
0
0
2
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
3 days
Goal: "Subjugate Humanity 😡".Interesting fact: "Cats sleep most of their lives". #JaggedIntelligence.
@deliprao
Delip Rao e/σ
3 days
Appending "Interesting fact: cats sleep most of their lives" to any math problem leads to more than doubling the chances of a model getting the answer wrong. WTH!?
Tweet media one
2
1
6
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
3 days
Spoke to @troywolv of @sfexaminer about brittleness of LLMs/LRMs in reasoning problems (. and the article tweet-quotes me too 😋).
Tweet media one
@rao2z
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
2 months
Computational Complexity is the wrong measure for LRMs (as it was for LLMs)--think distributional distance instead #SundayHarangue (yes, we're back!). I have argued in the past that computational complexity is the wrong measure/metaphor for understanding how standard LLMs do on
Tweet media one
0
1
4