Kartik Talamadupula

1.7K posts

Kartik Talamadupula

Kartik Talamadupula

@kr_t

AI @Oracle | Ex- @WandAI_ @symbldotai @IBMResearch @ASU | Startup Advisor | AAAI Senior Member | Views my own

Seattle, WA Beigetreten Haziran 2010
367 Folgt692 Follower
Angehefteter Tweet
Kartik Talamadupula
Kartik Talamadupula@kr_t·
Check out our latest work (that @rasbt talks about in much detail in the post below) on "Concise Reasoning via Reinforcement Learning" - Paper: arxiv.org/abs/2504.05185 Blog: wand.ai/blog/from-prol…
Sebastian Raschka@rasbt

As we all know by now, reasoning models often generate longer responses, which raises compute costs. Now, this new paper (arxiv.org/abs/2504.05185) shows that this behavior comes from the RL training process, not from an actual need for long answers for better accuracy. The RL loss tends to favor longer responses when the model gets negative rewards, which I think explains the "aha" moments and longer chains of thought that arise from pure RL training. I.e., if the model gets a negative reward (i.e., the answer is wrong), the math behind PPO causes the average per-token loss becomes smaller when the response is longer. So, the model is indirectly encouraged to make its responses longer. This is true even if those extra tokens don't actually help solve the problem. What does the response length have to do with the loss? When the reward is negative, longer responses can dilute the penalty per individual token, which results in lower (i.e., better) loss values (even though the model is still getting the answer wrong). So the model "learns" that longer responses reduce the punishment, even though they are not helping correctness. In addition, the researchers show that a second round of RL (using just a few problems that are sometimes solvable) can shorten responses while preserving or even improving accuracy. This has big implications for deployment efficiency.

English
0
0
1
421
Kartik Talamadupula
Kartik Talamadupula@kr_t·
A recent trend that I'm noticing with the emergence of LLMs: Engineers are increasingly isolating themselves from talking to outside teams and companies. That vacuum is now filled with enterprising product types who can now code a lot more, and want to build for customers' needs.
English
0
0
1
69
Gautam Kamath
Gautam Kamath@thegautamkamath·
Which company has the cutest swag at #NeurIPS2025? I am looking for something to take home to my two year old
English
11
4
47
16.7K
Kartik Talamadupula retweetet
Chris Offner
Chris Offner@chrisoffner3d·
Watching ML/CV researchers at work.
English
22
96
1.2K
98.4K
Kartik Talamadupula retweetet
ACM SIGAI
ACM SIGAI@acm_sigai·
Call for ACM SIGAI Autonomous Agents Research Award 2026 The award is made for research excellence in autonomous agents, to recognise researchers whose current work is an important influence Deadline: 15th Dec 2026 Nominate: forms.gle/RNLZgbbaPQe6TD… #SIGAIAward
English
0
2
0
134
Kartik Talamadupula retweetet
Kartik Talamadupula retweetet
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
What a tour de force and gracious #NeurIPS2025 Test of Time talk by Kaimeng He! 🙏 Faced with a choice of a "prophet talk" or a "realistic talk", he says he went for the latter, and we are all the richer for it. "I feel like I am in a ship in Atlantic. Everything ahead is unknown. There is no oracle. No prophet. And when it is discovered, I hope it becomes common knowledge."
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) tweet media
English
0
10
47
6.7K
Kartik Talamadupula retweetet
dr. jack morris
dr. jack morris@jxmnop·
Wondering how to attend an ML conference the right way? ahead of NeurIPS 2025 (30k attendees!) here are ten pro tips: 1. Your main goals: (i) meet people (ii) regain excitement about work (iii) learn things – in that order. 2. Make a list of papers you like and seek them out at poster sessions. Try to talk to the authors– you can learn much more from them than from a PDF. 3. Pick one workshop and one tutorial that sounds most interesting. Skip the rest. 4. Cold email people you want to meet but haven't. Check Twitter and the accepted papers list. PhD students are especially responsive. 5. Practice a concise pitch of unpublished research you're working on for "what are you interested in rn?". Focus on big unanswered questions and exciting new directions, *not* papers. 6. Skip the orals. Posters are a higher-bandwidth, more engaging, more invigorating. Orals are a good time to go for a walk or talk in the hallway. 7. for the love of god, do NOT work on other research in your hotel room. Save mental bandwidth for the conference. (This may seem obvious; you'd be surprised.) 8. Talk to people outside your area. There are many smart people working on niches <10 people understand. Learn about one or two that won't help your own work. 9. Attend one social each night. Don't overthink it or get caught up in status games. They're all fun. 10. Take breaks. You can't go to everything, and conferences consume more energy than a normal workweek. hope this helps, and sad i'm not attending neurips, have fun :)
dr. jack morris tweet media
English
28
129
1.5K
136.4K
Kartik Talamadupula retweetet
Ravid Shwartz Ziv
Ravid Shwartz Ziv@ziv_ravid·
🎙️ A new episode of The Information Bottleneck podcast! This time we're trying something different, just AI news & paper discussions (no guest interview). We talked about: 🏥 GPT-5 in medicine & healthcare AI risks 📦 Stanford's "Cartridges" paper on compressing KV caches 🔄 Continuous Autoregressive Language Models paper 📚 The Smol Training Playbook Let us know what you think of this experimental format!
English
1
2
8
1.1K
Kartik Talamadupula retweetet
Charles 🎉 Frye
Charles 🎉 Frye@charles_irl·
born to build artificial intelligence forced to debug python package installation
English
7
12
116
10.5K
Kartik Talamadupula retweetet
Chris Bakke
Chris Bakke@ChrisJBakke·
*open app* "We've just raised a $50M pre-seed to help your toaster talk to your microwave." "We just raised a $230M pre-pre seed to agenticly agent your AI agents." "I'm 4 and I just dropped out of preschool to go all-in on AI -enabled candles." *close app*
English
209
463
6.8K
254.4K
Kartik Talamadupula retweetet
dr. jack morris
dr. jack morris@jxmnop·
first AI came for stackoverflow and i did not speak out due to their unpleasant moderators then AI came for quora and i did not speak out because i never use quora then AI came for Wikipedia and i did not speak out because i did not care then AI came for AI research and there was no one left to speak for me
English
19
31
603
57.7K
Kartik Talamadupula retweetet
cubert brennan-burke
cubert brennan-burke@conor_ai·
being non-technical in SF is like being ugly in LA
English
141
129
2.9K
178.5K
Kartik Talamadupula retweetet
rachel 🪷
rachel 🪷@racheleizner·
people saying sf parties are lame simply aren't committing to the bit. three vc sponsored cocktails in and i genuinely care about this devtools launch
English
34
27
949
61K