Rakesh R Menon

226 posts

Rakesh R Menon

Rakesh R Menon

@rrmenon10

CS PhD Candidate @uncnlp @umasscs, @iitmadras alum.

Katılım Mart 2011
337 Takip Edilen254 Takipçiler
Rakesh R Menon retweetledi
John Murzaku
John Murzaku@jmurzaku·
Infra headaches from changing docs? We built InfraReconciler at @ycombinator Hack the Stackathon. 1. scrape & snapshot months of infra docs, running daily for new changes (thanks @firecrawl and @supabase) 2. plug into your infra code on @github 3. pinpoint fixes, warnings, and new best practices with InfraAgent (thanks @openrouter).
John Murzaku tweet media
English
0
2
13
348
Rakesh R Menon retweetledi
Sagnik
Sagnik@saagnikkk·
🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models” From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮 And this isn’t a one-off. The pattern holds across RL algorithms and models. 🧵A Deep Dive
Sagnik tweet media
English
17
134
909
192.8K
Rakesh R Menon retweetledi
John Murzaku
John Murzaku@jmurzaku·
@PrimeIntellect I built an env to test whether audio LLMs build prosody‑only reasoning chains on MELD (multimodal emotion dataset). Models listen to an utterance (raw audio file) and then must output one sentence like: higher pitch, noticeable variation, moderate volume, quick pace . This was so fun I feel lots of momentum to continue this and the ideas are flowing lol. @willccbb I made some minor changes to verifiers to allow audio input (see github.com/yurpl/verifiers). HuggingFace dataset available here: huggingface.co/datasets/jmurz… Inspired by Jeff Wu's paper here: arxiv.org/abs/2407.21315 Env here: app.primeintellect.ai/dashboard/envi…
English
6
12
92
9.1K
Rakesh R Menon retweetledi
Wadhwani School of Data Science & AI (WSAI), IITM
A moment of global pride for India in the field of AI. Prof. Mitesh M. Khapra, Co-founder at the Nilekani Centre at AI4Bharat, WSAI, IIT Madras, has been featured among '2025 TIME100 AI List of the World’s Most Influential People in Artificial Intelligence' AI4Bharat is one-of-a-kind project which collected thousands of hours of voice data across 400 districts. His pioneering work is bridging the AI gap for Indian languages. 🔗: time.com/collections/ti… @TIME @ravi_iitm @rbc_dsai_iitm @IBSE_IITM @ai4bharat @wcte_iitm @iitmadras @EduMinOfIndia @OfficialINDIAai @SarvamAI #TIME100AI #ArtificialIntelligence #TIME100AI2025 #AI4Bharat #FutureOfAI #IndiansInSTEM #DigitalIndia #SovereignAI #IITMadras #MiteshKhapra
Wadhwani School of Data Science & AI (WSAI), IITM tweet media
English
1
17
115
4.4K
Rakesh R Menon retweetledi
Rheeya Uppaal
Rheeya Uppaal@RUppaal·
I will be at ICLR next week to share our work on Model Editing for Alignment! DM if you'd like to chat about safety, interpretability, life in general or tourist spots in Singapore! #NLP #AISafety #Interpretability #ICLR2025 @iclr_conf
Rheeya Uppaal@RUppaal

@iclr_conf paper alert! The de facto way to align a model through tuning-based methods like DPO is powerful, yet expensive and prone to jailbreaking. Emerging work on model editing aims to address this, and yet the two approaches are largely siloed. Can we somehow connect them?🧐

English
0
3
70
6.7K
Rakesh R Menon retweetledi
Somnath Basu Roy Chowdhury
Somnath Basu Roy Chowdhury@SomnathBrc·
𝐇𝐨𝐰 𝐜𝐚𝐧 𝐰𝐞 𝐩𝐞𝐫𝐟𝐞𝐜𝐭𝐥𝐲 𝐞𝐫𝐚𝐬𝐞 𝐜𝐨𝐧𝐜𝐞𝐩𝐭𝐬 𝐟𝐫𝐨𝐦 𝐋𝐋𝐌𝐬? Our method, Perfect Erasure Functions (PEF), erases concepts from LLM representations w/o parameter estimation, achieving pareto optimal erasure-utility tradeoff w/ guarantees. #AISTATS2025 🧵
Somnath Basu Roy Chowdhury tweet media
English
2
35
153
23.4K
Rakesh R Menon retweetledi
Association for Computing Machinery
Meet the recipients of the 2024 ACM A.M. Turing Award, Andrew G. Barto and Richard S. Sutton! They are recognized for developing the conceptual and algorithmic foundations of reinforcement learning. Please join us in congratulating the two recipients! bit.ly/4hpdsbD
English
32
456
1.5K
455.3K
Rakesh R Menon retweetledi
Danish Pruthi
Danish Pruthi@danish037·
Remember this study about how LLM generated research ideas were rated to be more novel than expert-written ones? We find a large fraction of such LLM generated proposals (≥ 24%) to be skillfully plagiarized, bypassing inbuilt plagiarism checks and unsuspecting experts. A 🧵
CLS@ChengleiSi

Automating AI research is exciting! But can LLMs actually produce novel, expert-level research ideas? After a year-long study, we obtained the first statistically significant conclusion: LLM-generated ideas are more novel than ideas written by expert human researchers.

English
31
246
1.6K
217.9K
Rakesh R Menon retweetledi
Anvesh Rao
Anvesh Rao@nvshrao·
(1/4) Excited to present our latest work "𝐄𝐱𝐩𝐥𝐨𝐫𝐢𝐧𝐠 𝐒𝐚𝐟𝐞𝐭𝐲-𝐔𝐭𝐢𝐥𝐢𝐭𝐲 𝐓𝐫𝐚𝐝𝐞-𝐎𝐟𝐟𝐬 𝐢𝐧 𝐏𝐞𝐫𝐬𝐨𝐧𝐚𝐥𝐢𝐳𝐞𝐝 𝐋𝐌𝐬" at #NAACL2025! 🎉 We investigate 𝑃𝑒𝑟𝑠𝑜𝑛𝑎𝑙𝑖𝑧𝑎𝑡𝑖𝑜𝑛 𝐵𝑖𝑎𝑠 in LLMs. 📄 Paper: arxiv.org/abs/2406.11107 🧵👇
Anvesh Rao tweet media
English
2
7
34
4K
Rakesh R Menon retweetledi
Nitay Calderon
Nitay Calderon@NitCal·
Our paper: "On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs" has been accepted to NAACL 2025 main 🎉 We’ve updated the final version: 🔗arxiv.org/abs/2407.19200 If you are an NLP interpretability researcher, we have key takeaways for you!👇
Nitay Calderon tweet media
English
1
28
80
9.6K
Rakesh R Menon retweetledi
Rheeya Uppaal
Rheeya Uppaal@RUppaal·
@iclr_conf paper alert! The de facto way to align a model through tuning-based methods like DPO is powerful, yet expensive and prone to jailbreaking. Emerging work on model editing aims to address this, and yet the two approaches are largely siloed. Can we somehow connect them?🧐
Rheeya Uppaal tweet media
English
2
10
43
14.1K
Rakesh R Menon retweetledi
IIT Madras
IIT Madras@iitmadras·
@iitmadras has partnered with @perplexity_ai, a revolutionary search engine founded by our alumnus, Dr Aravind Srinivas (@AravSrinivas), to offer its faculty and students free access to Perplexity Pro. This cutting-edge search engine is designed to provide a more robust and versatile search experience, empowering users to conduct more thorough investigations into topics. Perplexity Pro stands out from its free version with several enhancements, including access to more powerful AI models, a choice of various AI models for searches, and deeper search capabilities. These features enable users to delve deeper into subjects and gather more accurate information. Prof. B Ravindran (@ravi_iitm), Head of the Wadhwani School of Data Science and AI (@WSAI_IITM) at IIT Madras, lauded this initiative, highlighting Perplexity AI's reliability and potential to transform how future generations learn and consume information online. IIT Madras extends its heartfelt gratitude to Dr. Aravind Srinivas for this generous gesture, which will undoubtedly enhance research and learning experiences of our faculty and students. With Perplexity Pro, IIT Madras faculty and students will have access to real-time information, conversational interfaces, source transparency, and advanced AI technology. This partnership is poised to enhance research and learning experiences, fostering a community of innovators and thinkers. #IITMadras #PerplexityAI #ArtificialIntelligence #SearchEngine #Research #Learning #Innovation #Collaboration #EmpoweringStudents
IIT Madras tweet mediaIIT Madras tweet media
English
6
8
67
2.6K
Rakesh R Menon retweetledi
Ekin Akyürek
Ekin Akyürek@akyurekekin·
Why do we treat train and test times so differently? Why is one “training” and the other “in-context learning”? Just take a few gradients during test-time — a simple way to increase test time compute — and get a SoTA in ARC public validation set 61%=avg. human score! @arcprize
Ekin Akyürek tweet media
English
35
324
1.8K
495.6K
Rakesh R Menon retweetledi
The Nobel Prize
The Nobel Prize@NobelPrize·
BREAKING NEWS The Royal Swedish Academy of Sciences has decided to award the 2024 #NobelPrize in Physics to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”
The Nobel Prize tweet media
English
979
13.1K
32.2K
12.7M
Rakesh R Menon retweetledi
Rheeya Uppaal
Rheeya Uppaal@RUppaal·
Excited to share our latest research on improving the safety of LLMs! We've developed DeTox, a tuning-free and noise robust alignment method that significantly reduces model toxicity without the need for large-scale preference data. 🚀 1/n
Rheeya Uppaal tweet media
English
2
8
49
6.9K
Rakesh R Menon retweetledi
UNC College of Arts and Sciences
A @UNCSDSS seed grant will support Angel Hsu and collaborators Shashank Srivastava and Jeffrey Mittelstadt in fine-tuning a large language model, ChatNetZero, designed to better understand companies’ and governments’ net-zero commitments. ow.ly/WYfp50RNttu @datadrivenlab
UNC College of Arts and Sciences tweet media
English
0
6
9
1.2K