Tanmoy Chakraborty

1.5K posts

Tanmoy Chakraborty

@Tanmoy_Chak

Chair Prof in AI, Associate Prof @iitdelhi; ACM Distinguished Speaker; Lab @lcs2lab; Previously @IIITDelhi @UofMaryland @iitkgp; #NLP #LLMs

New Delhi, India Katılım Ekim 2014

817 Takip Edilen2.5K Takipçiler

Sabitlenmiş Tweet

Tanmoy Chakraborty@Tanmoy_Chak·19 Ara

🌟 𝐀 𝐍𝐞𝐰 T𝐞𝐱𝐭𝐛𝐨𝐨𝐤 -- 𝐈𝐧𝐭𝐫𝐨𝐝𝐮𝐜𝐭𝐢𝐨𝐧 𝐭𝐨 𝐋𝐚𝐫𝐠𝐞 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥𝐬 🌟 I am excited to share the release of my new textbook, 𝘐𝘯𝘵𝘳𝘰𝘥𝘶𝘤𝘵𝘪𝘰𝘯 𝘵𝘰 𝘓𝘢𝘳𝘨𝘦 𝘓𝘢𝘯𝘨𝘶𝘢𝘨𝘦 𝘔𝘰𝘥𝘦𝘭𝘴 (#LLMs) -- Perhaps the first textbook on LLMs. Target Audience: 👉 Students/beginners, Looking for a structured starting point to learn LLMs 👉 Teachers, planning to offer a course on LLMs 👉 Industry professional, seeking to deepen their understanding of LLMs Explore the Book: 🔗 Book Website: tanmoychak.com/llmbook/ 📑 Table of Contents: tanmoychak.com/llmbook/toc.pdf 🛒 Available on Amazon: amazon.in/dp/936386474X/ Enhance Your Learning Experience: 👉 Slides & Lecture Videos: Chapter-wise resources -- lcs2-iitd.github.io/ELL881-AIL821-… 👉 Exercises & Solutions: Practice with detailed chapter exercises (solutions available on request). 👉 Upcoming @nptel_official Course: Starting January 2025! Preview here: onlinecourses.nptel.ac.in/noc25_cs45/pre… Book Endorsement: 📖 Foreword by Prof. Tim Baldwin @eltimster 👏 Endorsements from Prof. Iryna Gurevych @IGurevych and Prof. Pushpak Bhattacharyya #LLMs #Textbook @iitdelhi @WileyIndiaPL @lcs2lab

English

9.9K

Tanmoy Chakraborty@Tanmoy_Chak·1d

This is huge. Our PEFT method, MonteCLoRA, has been merged with @huggingface. Do use it. Believe me. It is much much better than LoRA in terms of efficiency and stability.

LCS2 Lab@lcs2lab

Excited to share that our work, #MonteCLoRA, has officially been merged into the #HuggingFace PEFT library! 🥳 github.com/huggingface/pe… Build #peft from source to use it right away! 🚀 📜 Paper: arxiv.org/abs/2411.04358 🤗 Docs: #monteclora-monte-carlo-low-rank-adaptation" target="_blank" rel="nofollow noopener">huggingface.co/docs/peft/main…

English

2.3K

Tanmoy Chakraborty@Tanmoy_Chak·6 May

Time to celebrate acceptance of two papers in 𝐈𝐂𝐌𝐋'26, including one 𝐒𝐩𝐨𝐭𝐥𝐢𝐠𝐡𝐭 (top 2.2%) 🎉 👉 Polaris: Coupled Orbital Polar Embeddings for Hierarchical Concept Learning 📔 arxiv.org/pdf/2605.00265 ✨ Introduces Polaris -- a hyperspherical embedding framework that decouples semantics from hierarchy using orbital geometry, uncertainty-aware learning, and efficient retrieval. 👉 Linguistic Properties and Model Scale in Brain Encoding: From Small to Compressed Language Models (#𝐒𝐩𝐨𝐭𝐥𝐢𝐠𝐡𝐭) 📔 arxiv.org/pdf/2602.07547 ✨ Shows that compact ~3B models can match much larger LLMs in brain alignment, with robustness even under compression. Grateful to all collaborators and students for the amazing work! 🚀 @icmlconf @lcs2lab @iitdelhi #ICML26

English

5.6K

Tanmoy Chakraborty retweetledi

LCS2 Lab@lcs2lab·22 Nis

🇧🇷 #LCS2 goes to #Rio 🇧🇷 Presenting our paper where we move beyond memoryless personalization → modeling user preferences as action-conditioned geometric walks with memory for better, user-aligned summaries. See you at #Riocentro 🚀 #Personalization #RepresentationLearning

LCS2 Lab@lcs2lab

Happy to announce that our paper has been accepted to #ICLR2026! 🎉 📜 Beyond Markovian Drifts: Action-Biased Geometric Walks with Memory for Personalized Summarization 👥 Parthiv Chatterjee, Asish Batha, Tashvi Patel, @sourish_rygbee, @Tanmoy_Chak Congratulations to all authors!

English

434

Tanmoy Chakraborty retweetledi

Dhruv Sahnan@dhruv_sahnan·20 Nis

🚨 CLEF 2026 - CheckThat! Lab We are excited to announce that we are organising a task at this year’s CheckThat! Lab, which extends the fact-checking pipeline with a new task focused on an important step in professional fact-checking: generating full fact-checking articles 📰

English

241

Tanmoy Chakraborty retweetledi

Lossfunk@lossfunk·15 Nis

🚨 Submissions are now open for the Conference for AI Scientists (CAISc) 2026, co-organised by Lossfunk and @bitspilaniindia. Submit to probe what happens when AI systems drive scientific discovery. Submissions are open until May 15! Here is everything you need to know 🧵

English

103

25.9K

Tanmoy Chakraborty retweetledi

Dhruv Kumar@gargdhruv36·15 Nis

An AI system MUST be the primary author: that's the only rule! Thrilled to be co-organizing this pioneering conference CAISc 2026! Send in your AI-driven research by May 15th.. @bitspilaniindia @ramgopal_rao @murari_ai @Tanmoy_Chak @palashiitkgp

Lossfunk@lossfunk

English

1.2K

Tanmoy Chakraborty@Tanmoy_Chak·7 Nis

Six papers from our lab have been accepted for publication in #ACL2026. The papers cover topics including Interpretability, empowering small VLMs with advanced tool calling, LLM personalisation, and different benchmarking. #nlproc @aclmeeting

English

3.8K

Tanmoy Chakraborty@Tanmoy_Chak·7 Nis

@manojbalaji1 My paper was not rejected becasue of this :)

English

876

Manoj Balaji@manojbalaji1·7 Nis

I completely understand your frustration, and honestly, the reasoning you were given is really difficult to accept. Sometimes in academia, we just run into unfortunate situations that feel entirely out of our control. I can definitely relate to what you are going through. Last year, my paper was rejected by EMNLP simply because one of my co-authors was flagged as an unresponsive reviewer. Unfortunately, the rest of the team only found out at the final decision stage. I actually reached out to the EMNLP 2025 organizers to ask why they didn't notify the co-authors beforehand. NeurIPS has a great system where they nudge co-authors so we can help remind the reviewer, which I think is a highly logical and effective approach. However, the EMNLP team explained that they had reminded the reviewer directly multiple times and that their policies simply differ from those of NeurIPS. It is incredibly disheartening when administrative policies, rather than the quality of the research itself, negatively impact a paper's outcome. The peer review system certainly has its flaws, and as a community, we need to keep advocating to improve them. Hang in there, and please don't let this discourage you! Many of us have faced similar hurdles, and your work is still valuable!

English

1.2K

Tanmoy Chakraborty@Tanmoy_Chak·7 Nis

I strongly condemn and protest against rejecting a paper from ACL with such a justification. If I am not mistaken, "Findings" started with the motivation of accommodating such borderline "good" papers. I don’t see any reason behind such a justification, given that ACL does not have any venue constraints (runs in hybrid mode). #ACL2026 #NLProc @aclmeeting

English

15.6K

Tanmoy Chakraborty@Tanmoy_Chak·6 Nis

@ANRFIndia Awesome. Thanks a lot @ANRFIndia

English

Anusandhan National Research Foundation@ANRFIndia·6 Nis

@Tanmoy_Chak Please check @Tanmoy_Chak

English

Tanmoy Chakraborty@Tanmoy_Chak·6 Nis

@ANRFIndia I'm trying to upload my MAHA proposal, but the portal appears to have issues: 1. The detailed proposal uploaded under the "Details" tab - "Upload Other Technical Details" is not reflected in the PDF under the under "Preview and Submit" tab. 2. My updated CV is not showing in the final PDF (the old version appears instead). Could this be checked urgently? The deadline is tomorrow.

English

512

Tanmoy Chakraborty@Tanmoy_Chak·23 Mar

Our newly introduced 𝐆𝐔𝐈𝐃𝐄-𝐋𝐋𝐌 -- A reporting checklist for using LLMs in behavioral & social science. Massive collaborative effort led by @stfeuerriegel.

Stefan Feuerriegel@stfeuerriegel

🚀Introducing 𝐆𝐔𝐈𝐃𝐄-𝐋𝐋𝐌: A reporting checklist for using LLMs in behavioral & social science ✅GUIDE-LLM is a reporting checklist designed by 80+ experts to improve transparency, reproducibility & ethical accountability of LLM-based research 📄llm-checklist.com

English

1.2K

Tanmoy Chakraborty retweetledi

Lossfunk@lossfunk·18 Mar

2/ The organising committee for CAISc 2026 is led by @paraschopra, @dhruvtrehan9, and @gargdhruv36. We are glad to have @Tanmoy_Chak (IIT Delhi), Palash Goyal (Google Research), Dr Mohan Kankanhalli (NUS AI Institute), Shirish Karande (TCS Research) on our steering committee, and @murari_ai and Pratik Narang as our Program Committee Chairs. Additionally, our program committee for final human review spans CS, Mathematics, electrical engineering, and not just ML.

English

2.5K

Tanmoy Chakraborty@Tanmoy_Chak·16 Mar

@flyspicejet @flyspicejet I received the refund today. Thank you very much.

English

SpiceJet@flyspicejet·14 Mar

@Tanmoy_Chak We sincerely regret any inconvenience caused, Tanmoy. We are in receipt of your email, and have updated our team to check and revert soon.

English

124

Tanmoy Chakraborty@Tanmoy_Chak·14 Mar

@flyspicejet Refund of ~₹1.97L for a failed booking on 6 Mar is still pending despite multiple follow-ups (even after the promised 5 working days). Case no: 10509519. No response to email and unsatisfactory customer support. If the refund is not processed immediately, I will be forced to escalate this to consumer protection authorities and financial regulators. Please don't expect people to have infinite bandwidth to follow up regularly.

English

852

Tanmoy Chakraborty@Tanmoy_Chak·7 Mar

@flyspicejet our flight from FJR to Delhi got cancelled again today -- SJ9085 at 1305. No rescheduled notification. This is the second time we booked tickets. My 4.5L of ticket fees is on hold. I am accompanied by my 3 yrs old son and wife. Pls reschedule the flight asap. We are not in a position to book other flight due to financial constraints. Pls understand the situation.

English

SpiceJet@flyspicejet·6 Mar

@Tanmoy_Chak Hi Tanmoy, we have responded to you via DM.

English

112

Tanmoy Chakraborty@Tanmoy_Chak·6 Mar

@flyspicejet We booked tickets from Fujairah to Delhi for 7th Mar at 13:05. The payment was successful. But we did not receive tickets. Please check asap.

English

834

Tanmoy Chakraborty@Tanmoy_Chak·6 Mar

@flyspicejet Alright. Please consider this very urgent. I will not book tickets further until you message me. @flyspicejet

English

108

SpiceJet@flyspicejet·6 Mar

@Tanmoy_Chak We shall reply to your DM shortly.

English

125

Tanmoy Chakraborty@Tanmoy_Chak·6 Mar

@flyspicejet @flyspicejet pls reply asap to my DM. We are waiting. Either you send us the tickets or cancel the transaction and initiate a refund so that we can book it again.

English

137

SpiceJet@flyspicejet·6 Mar

@Tanmoy_Chak Hi Tanmoy, we have responded to you via DM.

English

160

Tanmoy Chakraborty@Tanmoy_Chak·3 Mar

Our new study on interpretability explains -- 𝐭𝐡𝐞 𝐏𝐡𝐲𝐬𝐢𝐜𝐬 𝐨𝐟 𝐊𝐕 𝐂𝐚𝐜𝐡𝐞 𝐂𝐨𝐦𝐩𝐫𝐞𝐬𝐬𝐢𝐨𝐧 𝐟𝐨𝐫 𝐋𝐋𝐌𝐬 Pre-print: arxiv.org/abs/2603.01426 As context lengths continue to grow, the KV cache has become the primary memory bottleneck during inference. While many compression techniques report impressive memory savings with minimal drops in benchmark accuracy, we asked a more structural question: 👉 𝘞𝘩𝘢𝘵 𝘢𝘤𝘵𝘶𝘢𝘭𝘭𝘺 𝘩𝘢𝘱𝘱𝘦𝘯𝘴 𝘵𝘰 𝘢𝘵𝘵𝘦𝘯𝘵𝘪𝘰𝘯 𝘢𝘯𝘥 𝘳𝘦𝘢𝘴𝘰𝘯𝘪𝘯𝘨 𝘸𝘩𝘦𝘯 𝘸𝘦 𝘤𝘰𝘮𝘱𝘳𝘦𝘴𝘴 𝘵𝘩𝘦 𝘒𝘝 𝘤𝘢𝘤𝘩𝘦? We frame KV compression as a 𝐜𝐨𝐧𝐭𝐫𝐨𝐥𝐥𝐞𝐝 𝐩𝐞𝐫𝐭𝐮𝐫𝐛𝐚𝐭𝐢𝐨𝐧 𝐨𝐟 𝐭𝐨𝐤𝐞𝐧-𝐥𝐞𝐯𝐞𝐥 𝐫𝐨𝐮𝐭𝐢𝐧𝐠 𝐢𝐧 𝐬𝐞𝐥𝐟-𝐚𝐭𝐭𝐞𝐧𝐭𝐢𝐨𝐧. Rather than evaluating only final task accuracy, we design synthetic datasets to probe: (1) Multi-entity tracking, (2) Coreference resolution, and (3) Multi-hop reasoning. This setup allows us to disentangle three critical dimensions: Information Retention, Accessibility, and Utilisation. Our findings reveal an interesting pattern: 👉 𝐌𝐨𝐝𝐞𝐫𝐚𝐭𝐞 𝐜𝐨𝐦𝐩𝐫𝐞𝐬𝐬𝐢𝐨𝐧 often preserves surface-level accuracy despite substantial internal representational degradation — suggesting significant redundancy in current models. 👉 𝐍𝐞𝐚𝐫 𝐞𝐱𝐭𝐫𝐞𝐦𝐞 𝐜𝐨𝐦𝐩𝐫𝐞𝐬𝐬𝐢𝐨𝐧, we observe a sharp "safety cliff" in hallucinations, driven by global erasure of answer-critical tokens. 👉 We also uncover a second failure mode -- 𝐫𝐞𝐩𝐫𝐞𝐬𝐞𝐧𝐭𝐚𝐭𝐢𝐨𝐧𝐚𝐥 𝐫𝐢𝐠𝐢𝐝𝐢𝐭𝐲 -- where tokens remain present, but routing flexibility collapses. These results suggest that evaluating compression solely through downstream accuracy can mask stronger structural effects on reasoning. Understanding these internal dynamics is crucial as we move toward longer-context and more memory-efficient LLMs. Brilliant work by Ayan Sengupta and Samhruth Ananthanarayanan. #ScienceofLLMs #Interpretability #KVCache #ModelCompression

English

3.7K

Keşfet

@huggingface @icmlconf @lcs2lab @iitdelhi @bitspilaniindia @ramgopal_rao @murari_ai @palashiitkgp