Aditi Mavalankar

140 posts

Aditi Mavalankar

@aditimavalankar

Research Scientist @DeepMind

London, UK Katılım Mart 2017

420 Takip Edilen2K Takipçiler

Aditi Mavalankar retweetledi

Abhinav Moudgil@amoudgl·31 Mar

Introducing Celo2: Towards Learned Optimization Free Lunch We show that learned optimizers can generalize to practical tasks like GPT-3 1.3B pretraining and several out-of-distribution vision/RL tasks from limited meta-training (~4.5 GPU hours)! 🧵

English

103

8.7K

Aditi Mavalankar@aditimavalankar·16 Mar

The RL team at DeepMind is looking for a Research Scientist to join us! Please apply if you are interested or spread the word to those who are :) job-boards.greenhouse.io/deepmind/jobs/…

English

235

18.3K

Aditi Mavalankar@aditimavalankar·12 Mar

Platform 37 right next to Platform 9 3/4!! 🚂🪄

Demis Hassabis@demishassabis

London has incredible talent & entrepreneurial spirit. Thrilled to deepen @GoogleDeepMind’s roots here with our spectacular new building Platform 37 - a nod to AlphaGo’s legendary Move 37. It’s a tribute to Science & AI, and an inspirational space for our next big breakthroughs!

English

1.2K

Aditi Mavalankar retweetledi

Seijin Kobayashi@SeijinKobayashi·6 Oca

Standard reinforcement learning in raw tokens is a disaster for sparse rewards! Here, we propose 𝗜𝗻𝘁𝗲𝗿𝗻𝗮𝗹 𝗥𝗟: acting on abstract actions emerging in the residual stream representation. A paradigm shift in using pretrained models to solve hard, long-horizon tasks! 🧵

GIF

English

121

940

249.9K

Aditi Mavalankar@aditimavalankar·21 Kas

@sushnt @OpenAI Congratulations Sushant!!

English

198

Sushant Sachdeva@sushnt·21 Kas

I believe we're at the doorstep of a revolution in how we'll do theoretical research and this is just the beginning! A good time to announce that I've taken leave from UofT to be a part of the revolution on the inside at @OpenAI! :)

Sebastien Bubeck@SebastienBubeck

3 years ago we could showcase AI's frontier w. a unicorn drawing. Today we do so w. AI outputs touching the scientific frontier: cdn.openai.com/pdf/4a25f921-e… Use the doc to judge for yourself the status of AI-aided science acceleration, and hopefully be inspired by a couple examples!

English

165

29.6K

Aditi Mavalankar retweetledi

Google DeepMind@GoogleDeepMind·18 Kas

This is Gemini 3: our most intelligent model that helps you learn, build and plan anything. It comes with state-of-the-art reasoning capabilities, world-leading multimodal understanding, and enables new agentic coding experiences. 🧵

English

213

1.1K

6.5K

1.7M

Aditi Mavalankar retweetledi

Luisa Zintgraf@luisa_zintgraf·4 Kas

Excited to share our new paper, "DataRater: Meta-Learned Dataset Curation"! We explore a fundamental question: How can we *automatically* learn which data is most valuable for training foundation models? Paper: arxiv.org/pdf/2505.17895 to appear @NeurIPSConf Thread 👇

English

325

130.3K

Aditi Mavalankar retweetledi

Dan A. Calian@dancalian·4 Kas

Dataset curation for language models has long relied on brittle, hand-crafted rules. It's time for a more principled, automated approach. Enter DataRater: a meta-learning framework that learns to value data based on downstream training efficiency. Great summary by Luisa below 👇

Luisa Zintgraf@luisa_zintgraf

English

820

Aditi Mavalankar@aditimavalankar·21 Tem

Gemini with advanced deep think achieved gold medal-level performance at IMO 2025!🥇 Very happy to have been a small part of this collaboration on the inference side, and congrats to everyone involved!

Google DeepMind@GoogleDeepMind

An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵

English

2.2K

Aditi Mavalankar@aditimavalankar·19 Tem

@richie_internet Congratulations, Richie!

Français

Richie Steigerwald@richie_internet·16 Tem

Announcing Asimov today! I've been using it extensively in refactoring the data pipeline to identify edge cases I might have otherwise missed.

Misha Laskin@MishaLaskin

Engineers spend 70% of their time understanding code, not writing it. That’s why we built Asimov at @reflection_ai. The best-in-class code research agent, built for teams and organizations.

English

417

Aditi Mavalankar@aditimavalankar·13 Tem

On my way to #ICML2025 to present our algorithm that strongly scales with inference compute, in both performance and sample diversity! 🚀 Reach out if you’d like to chat more!

Aditi Mavalankar@aditimavalankar

Excited to share our recent work, AuPair, an inference-time technique that builds on the premise of in-context learning to improve LLM coding performance! arxiv.org/abs/2502.18487

English

9.9K

Aditi Mavalankar@aditimavalankar·2 May

Accepted to #ICML2025 See you in Vancouver!

Aditi Mavalankar@aditimavalankar

Excited to share our recent work, AuPair, an inference-time technique that builds on the premise of in-context learning to improve LLM coding performance! arxiv.org/abs/2502.18487

English

Aditi Mavalankar@aditimavalankar·17 Mar

@zeewahee Thanks! Glad you liked the work :)

English

Zee Waheed@zeewahee·17 Mar

In-context learning is a fascinating line of work, and this paper is a great read in the context of coding performance!

Aditi Mavalankar@aditimavalankar

Excited to share our recent work, AuPair, an inference-time technique that builds on the premise of in-context learning to improve LLM coding performance! arxiv.org/abs/2502.18487

English

393

Aditi Mavalankar@aditimavalankar·17 Mar

This was a really fun collaboration with my brilliant collaborators Hassan Mansoor, Zita Marinho, Masha Samsikova, and Tom Schaul!

English

583

Aditi Mavalankar@aditimavalankar·17 Mar

In addition to this, AuPair has been shown to work better across codeforces difficulty levels and preserve coverage of problem categories from the training data distribution (see paper for more details).

English

705

Aditi Mavalankar@aditimavalankar·17 Mar

Excited to share our recent work, AuPair, an inference-time technique that builds on the premise of in-context learning to improve LLM coding performance! arxiv.org/abs/2502.18487

English

18.6K

Keşfet

@sushnt @OpenAI @NeurIPSConf @richie_internet @zeewahee @elonmusk @BarackObama @taylorswift13