Siddhant (Sid) Bhambri

88 posts

Siddhant (Sid) Bhambri

@sbhambr1

PhD @ Yochan Lab, ASU

Katılım Temmuz 2019

219 Takip Edilen120 Takipçiler

Siddhant (Sid) Bhambri@sbhambr1·1 May

🎉 ICML 2026 acceptance! Check out the detailed post below.

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z

Our position paper arguing against the anthropomorphization of intermediate tokens has been accepted to #ICML2026! I am tickled pink as I reversed the roles and did the writing and rebutting myself with "advise" from my students.. 😎

English

318

Siddhant (Sid) Bhambri@sbhambr1·7 Nis

Paper accepted at #ACL2026! Check out the detailed post below:

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z

#ACL2026 just accepted a paper lead by Yochanites @sbhambr1 & @biswas_2707 that shows, in a Q&A setting, that the intermediate tokens in LRMs (1) don't necessarily need to have user interpretable semantics and (2) distilling models with traces having semantics doesn't necessarily improve accuracy. 1/

English

685

Siddhant (Sid) Bhambri retweetledi

Upasana Biswas@biswas_2707·21 Oca

I will be presenting our paper at #AAAI 2026, co-authored with @PalodVardh12428, @sbhambr1, @rao2z Humans and AI 2 Session, Friday, Jan 23 |12:00–2:00 PM Hall 3, Poster Board 60 arxiv.org/pdf/2502.06976 Happy to talk more about multi-agent systems and human-AI collaboration!

English

3.9K

Siddhant (Sid) Bhambri retweetledi

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z·21 Kas

🎉Dr! Siddhant Bhambri, @sbhambr1, the 33rd Yochanite PhD🥳 Committee: @liuhuan @DBertsekas @keviv9 @SCAI_ASU

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) tweet media

हिन्दी

3.2K

Siddhant (Sid) Bhambri retweetledi

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z·21 Kas

ICYMI, here is Dr! @sbhambr1's PhD defense video from this morning.. 👉youtube.com/watch?v=mzAH3n…

YouTube

English

2.8K

Siddhant (Sid) Bhambri retweetledi

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z·21 Kas

A doctor a day keeps the apple away..😋 Dr! @karthikv792 (11/19) & Dr! @sbhambr1 (11/20) @SCAI_ASU

English

2.9K

Siddhant (Sid) Bhambri@sbhambr1·12 Kas

💡 Are AI agents trained to solve tasks with humans in a team actually cooperating? 🔗Check out our recent work accepted at #AAAI2026 that dives deeper into this question: lnkd.in/gFAfGMwR

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z

What if your cooperative AI agent is actively avoiding you? Despite significant interest in having human and AI agents teaming constructively to solve problems, most work in the area focuses on the bottom line task reward rather than any actual cooperation between the agents. In many cases, where the task can, in principle, be completed by either agent alone albeit with additional burden (i.e., the task doesn't require cooperation), task reward itself doesn't give any indication of whether there is any actual cooperation between the agents. In a paper to be presented at #AAAI2026, Yochanite @biswas_2707 (w/ @PalodVardh12428 and @sbhambr1) develop a novel metric to analyze inter-dependencies between human and AI agents, and use that measure to evaluate cooperation induced by several SOTA AI agents trained for cooperative tasks. We see that most SOTA AI agents that claim to be RL trained for "Zero-shot cooperation" actually don't induce much inter-dependence between the AI and human agents at all. This calls into question the prevalent approach of training AI agents on task reward, and hoping for cooperation to emerge as a side effect!

English

199

Siddhant (Sid) Bhambri@sbhambr1·22 Eki

💡 𝐈𝐬 𝐬𝐞𝐦𝐚𝐧𝐭𝐢𝐜 𝐜𝐨𝐫𝐫𝐞𝐜𝐭𝐧𝐞𝐬𝐬 𝐨𝐟 𝐂𝐡𝐚𝐢𝐧 𝐨𝐟 𝐓𝐡𝐨𝐮𝐠𝐡𝐭 𝐭𝐫𝐚𝐜𝐞𝐬 𝐭𝐡𝐞 𝐬𝐚𝐦𝐞 𝐚𝐬 𝐥𝐨𝐜𝐚𝐥 𝐜𝐨𝐡𝐞𝐫𝐞𝐧𝐜𝐞? 🖇️ Check out our recent work critically looking at how trace coherence is impacted by RLVR post-training: lnkd.in/gAYq_s2b

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z

Our recent research efforts have questioned the narrative that the LRM intermediate tokens have semantics (see x.com/rao2z/status/1… ). Some may counter these with "..but I read the traces, and they do seem to make sense.." and claim RLVR post-training must be making the traces correct. We analyze this disconnect in terms of local coherence vs. global validity/correctness of the trace. 1/

English

614

Siddhant (Sid) Bhambri@sbhambr1·15 Eki

➡️ 𝘒𝘦𝘺 𝘱𝘢𝘵𝘩𝘸𝘢𝘺𝘴 𝘧𝘰𝘳 𝘥𝘦𝘴𝘪𝘨𝘯𝘪𝘯𝘨 𝘳𝘰𝘣𝘶𝘴𝘵 𝘢𝘯𝘥 𝘳𝘦𝘭𝘪𝘢𝘣𝘭𝘦, 𝘦𝘯𝘥 𝘶𝘴𝘦𝘳-𝘧𝘢𝘤𝘪𝘯𝘨 𝘈𝘐 𝘵𝘩𝘢𝘵 𝘣𝘢𝘭𝘢𝘯𝘤𝘦𝘴 𝘢𝘥𝘷𝘪𝘴𝘢𝘣𝘪𝘭𝘪𝘵𝘺 𝘢𝘯𝘥 𝘦𝘹𝘱𝘭𝘢𝘪𝘯𝘢𝘣𝘪𝘭𝘪𝘵𝘺. #AI #MachineLearning #LLMs #HumanAI

English

Siddhant (Sid) Bhambri@sbhambr1·15 Eki

➡️ 𝘞𝘩𝘢𝘵 𝘪𝘯𝘵𝘦𝘳𝘱𝘳𝘦𝘵𝘢𝘣𝘪𝘭𝘪𝘵𝘺 𝘢𝘯𝘥 𝘳𝘦𝘢𝘴𝘰𝘯𝘪𝘯𝘨 𝘵𝘳𝘢𝘤𝘦𝘴 𝘳𝘦𝘢𝘭𝘭𝘺 𝘮𝘦𝘢𝘯 𝘧𝘰𝘳 𝘦𝘯𝘥 𝘶𝘴𝘦𝘳𝘴 𝘴𝘦𝘦𝘬𝘪𝘯𝘨 𝘵𝘰 𝘵𝘳𝘶𝘴𝘵 𝘢𝘯𝘥 𝘶𝘯𝘥𝘦𝘳𝘴𝘵𝘢𝘯𝘥 𝘈𝘐 𝘴𝘺𝘴𝘵𝘦𝘮𝘴. (x.com/rao2z/status/1…) (x.com/rao2z/status/1…) (5/n)

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z

Semantics of Intermediate Tokens in Trace-based distillation in Q&A tasks: Yochanites @sbhambr1 and @biswas_2707 looked at distillation on a Q&A task, and found a disconnect between the validity of derivational traces and the correctness of the solution.. 🧵 1/

English

108

Siddhant (Sid) Bhambri@sbhambr1·15 Eki

Recent talk at @allen_ai: "𝐑𝐨𝐥𝐞 𝐨𝐟 𝐋𝐚𝐫𝐠𝐞 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥𝐬 𝐢𝐧 𝐇𝐮𝐦𝐚𝐧-𝐀𝐈 𝐈𝐧𝐭𝐞𝐫𝐚𝐜𝐭𝐢𝐨𝐧: 𝐀 𝐂𝐫𝐢𝐭𝐢𝐜𝐚𝐥 𝐀𝐩𝐩𝐫𝐚𝐢𝐬𝐚𝐥". Link:youtube.com/watch?v=rjZUBe… Thanks to @rao2z for guiding this research & to @dsweld for hosting me!🧵(1/n)

YouTube

English

Siddhant (Sid) Bhambri retweetledi

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z·27 Ağu

Since DeepSeek R1, it has become fashionable to assume that intermediate tokens have interpretable semantics. We have argued against this before. Here @sbhambr1 & @biswas_2707 ask: Is cognitive interpretability of intermediate tokens an albatross on task accuracy? 1/

English

Siddhant (Sid) Bhambri retweetledi

Accepted papers at TMLR@TmlrPub·17 Tem

Do Think Tags Really Help LLMs Plan? A Critical Evaluation of ReAct-Style Prompting Siddhant Bhambri, Mudit Verma, Subbarao Kambhampati. Action editor: Li Li. openreview.net/forum?id=aFAMP… #reasoning #prompting #prompts

English

1.5K

Siddhant (Sid) Bhambri retweetledi

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z·28 May

Anthropomorphization of intermediate tokens as reasoning/thinking traces isn't quite a harmless fad, and may be pushing LRM research into questionable directions.. So we decided to put together a more complete argument.. 👇🧵 1/

English

484

101.7K

Siddhant (Sid) Bhambri retweetledi

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z·25 May

English

10.5K

Siddhant (Sid) Bhambri retweetledi

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z·13 May

Delighted to share that @sbhambr1 & @v_mudit's critical evaluation and refutation of the reasoning claims of ReACT has been accepted to TMLR @TmlrOrg 👉openreview.net/forum?id=aFAMP…

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z

📢 ReAct popularized the "Think 🤔" magic by claiming to help LLMs plan by "synergizing reasoning and acting." @v_mudit & @sbhambr1 investigated the claims, and have a thing are two to say about the extreme brittleness of ReAct style prompting. 👉arxiv.org/abs/2405.13966 1/

English

2.1K

Siddhant (Sid) Bhambri retweetledi

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z·15 Ara

📢 If you are #NeurIPS2024 OWA-2024 workshop (East Meeting Room 1-3), do check out two posters presented by Yochanites @karthikv792, @kayastechly & @sbhambr1 👉 LLMs can't reason; can LRMs? (Evaluating and improving 🍓 o1 on planning & scheduling ) 👉 LLMs to reward shape RL search

English

1.8K

Keşfet

@PalodVardh12428 @rao2z @liuhuan @DBertsekas @keviv9 @SCAI_ASU @karthikv792 @allen_ai