Rahul Madhavan

4.6K posts

Rahul Madhavan banner
Rahul Madhavan

Rahul Madhavan

@imrahulmaddy

Building self-improving agents Research @ GoogleDeepMind

Katılım Kasım 2020
1.3K Takip Edilen1.2K Takipçiler
Rahul Madhavan retweetledi
fly51fly
fly51fly@fly51fly·
[LG] Efficient RL Training for LLMs with Experience Replay C Arnal, V Cabannes, T Cohen, J Kempe… [FAIR at Meta] (2026) arxiv.org/abs/2604.08706
fly51fly tweet mediafly51fly tweet mediafly51fly tweet mediafly51fly tweet media
English
0
3
18
1.5K
Rahul Madhavan retweetledi
Chris Hayduk
Chris Hayduk@ChrisHayduk·
In July 2024, DeepMind unveiled AlphaProof — an AlphaZero-inspired agent that constructs mathematical arguments in Lean, a programming language for proofs. It broke new ground in mathematical performance, achieving a silver medal in the 2024 International Math Olympiad. One year later, in July 2025, OpenAI announced that they had achieved a gold medal in the 2025 International Math Olympiad using a raw LLM — no reinforcement learning in Lean space, no translation between natural language and formal proof languages. In the span of a few weeks, this same model would go on to add a gold medal at the International Olympiad in Informatics and a 2nd place finish at the AtCoder World Tour Finals to its achievements. Since July 2025, I kept coming back to this puzzle: Why would a general-purpose language model, one just as comfortable answering questions about lasagna recipes in ChatGPT as it is answering mathematical questions, end up looking stronger at Olympiad math than a much more math-specific theorem-proving system? In my new blog post, I use legendary mathematician Jacques Hadamard's analysis of the phenomenology of mathematical discovery to attempt to answer this question. And to probe where LLMs are headed next. Link in the replies below.
Chris Hayduk tweet media
English
7
10
93
8.8K
Rahul Madhavan retweetledi
Peyman Milanfar
Peyman Milanfar@docmilanfar·
Outstanding researchers excel at the art of finding problems right at the edge of our understanding — that can actually be solved.
English
6
27
420
24.7K
Rahul Madhavan retweetledi
Rahul Madhavan retweetledi
이재명
이재명@Jaemyung_Lee·
<끊임없는 반인권적 반국제법적 행동으로 고통받고 힘들어하는 전 세계인들의 지적을 한번쯤은 되돌아볼 만도 한데 실망입니다. 내가 아프면 타인도 그만큼 아픕니다. 나의 필요 때문에 누군가 고통받으면 미안한 것이 인지상정입니다. 아닌 밤중에 홍두깨라고 아무 잘못없는 우리 국민들께서 뜬금없이 겪고 있는 이 엄청난 고통과 국가적 어려움을 지켜보는 마음이 매우 불편합니다. 보편적 인권과 대한민국의 국익을 위해 할 수 있는 일을 더 열심히 찾아봐야겠습니다.> 이스라엘, ‘전시 살해=유대인 학살’ 李대통령 발언에 “용납 못해” v.daum.net/v/202604110641…
한국어
2.5K
14K
51K
9.1M
Rahul Madhavan retweetledi
Kanjun 🐙
Kanjun 🐙@kanjun·
Twitter’s algorithm is optimized for addiction, not for us. We deserve better. We’re releasing Bouncer today so you can take back control of your feed. Describe what you don't want, and Bouncer removes it. It’s free, doesn’t collect your data, and will be open source soon.
English
210
292
3.1K
564.8K
Rahul Madhavan retweetledi
Rémi Lodh
Rémi Lodh@LodhSpringer·
This year, the world is marking the 200th anniversary of the birth of mathematician Bernhard Riemann. Renown historian David Rowe has undertaken a deep study of Riemann's life and work, completing what should be his definitive biography. It will be published in June, stay tuned!
Rémi Lodh tweet media
English
4
96
481
17.1K
Rahul Madhavan
Rahul Madhavan@imrahulmaddy·
Can the machine watch itself. Can it have a sense of self? Can it watch the universe, the effects of the actions of that self on the universe... Then is the universe observing itself? If it is, then it must be, that it is conscious.
English
0
0
0
19
Rahul Madhavan
Rahul Madhavan@imrahulmaddy·
Maybe in future a machine could also create an equivalent world, say in a distant planet. But taking action by itself does not create consciousness. The only question, maybe, is whether the Universe is observing itself through that machine.
English
1
0
0
22