
Siddhant Bhambri
@sbhambr1
Followers
89
Following
1K
Media
7
Statuses
75
Joined July 2019
RT @TmlrPub: Do Think Tags Really Help LLMs Plan? A Critical Evaluation of ReAct-Style Prompting. Siddhant Bhambri, Mudit Verma, Subbarao….
openreview.net
The reasoning abilities of Large Language Models (LLMs) remain a topic of considerable interest and debate. Among the original papers arguing for emergent reasoning abilities of LLMs, ReAct became...
0
2
0
RT @rao2z: Semantics of Intermediate Tokens in Trace-based distillation in Q&A tasks: Yochanites @sbhambr1 and @biswas_2707 looked at disti….
0
8
0
RT @rao2z: 📢 If you are #NeurIPS2024 OWA-2024 workshop (East Meeting Room 1-3), do check out two posters presented by Yochanites @karthikv7….
0
4
0
4/n Our experiments show how our framework can lead to a boost in sample efficiency for Reinforcement Learning! Joint work with @Amrita_Bh, @liuhuan and @rao2z, check out the paper for more details:
0
0
2
2/n We note that the performance of prompting off-the-shelf LLMs for decision-making tasks can be extremely brittle, even for popular prompting techniques such as ReAct! (see our other work that investigates these claims in detail: .
📢 ReAct popularized the "Think 🤔" magic by claiming to help LLMs plan by "synergizing reasoning and acting." @v_mudit & @sbhambr1 investigated the claims, and have a thing are two to say about the extreme brittleness of ReAct style prompting. 👉1/
1
0
1