Antoine Moulin

432 posts

Antoine Moulin

@antoine_mln

doing a phd in RL/online learning on questions related to exploration and adaptivity

Katılım Ağustos 2020

534 Takip Edilen1.4K Takipçiler

Antoine Moulin@antoine_mln·4d

@jsuarez If you look at go-explore you may also want to check out arxiv.org/abs/2603.22273 too!

English

716

Joseph Suarez 🐡@jsuarez·5d

I am attempting to solve curriculum learning in RL in the next 2 weeks. Join me every day for 6-8 hours of livestreamed research, a 10 km run, heavy compounds, and a 1,000 calorie cut. Details + ground rules on day 1!

English

624

48.1K

Antoine Moulin@antoine_mln·6d

@_arohan_ @adinapak_ Breakfast at dishoom is underrated

English

rohan anil@_arohan_·6d

@adinapak_ Both are restaurants in london. Highly recommend them

English

337

rohan anil@_arohan_·6d

How do I buy spv on dishoom and hoppers?

English

6.8K

Antoine Moulin retweetledi

Timothy Gowers @wtgowers@wtgowers·8 May

But if AI mathematics continues to progress at anything like its current rate -- which is what I expect to happen -- then we will face a crisis very soon, and mathematics departments, who owe a duty of care to their students, should be urgently preparing for it.

English

146

1.4K

505.7K

Antoine Moulin retweetledi

Pushmeet Kohli@pushmeet·8 May

The future of Math is mathematicians and AI agents working together. Very pleased to introduce @GoogleDeepMind's AI co-mathematician: a multi-agent system designed to actively collaborate with human experts on open-ended research mathematics. Mathematicians testing the agent across areas as diverse as group theory, Hamiltonian systems, and algebraic combinatorics have reported impressive results. In autonomous mode evaluation on the rigorous FrontierMath Tier 4 problems, AI co-mathematician scored an unprecedented 48% — a new high score among all AI systems evaluated.

English

171

368

2.6K

309.2K

Antoine Moulin@antoine_mln·1 May

@hbouammar @icmlconf When I search for the inverse RL one I only find another paper with the same title :(

English

277

Haitham Bou Ammar@hbouammar·30 Nis

ICML +3 🤙🏻🤙🏻 @icmlconf

English

6.8K

Lujain Ibrahim@lujainmibrahim·29 Nis

🚨Very excited to see our work on warmth & sycophancy in LLMs out in @Nature today!🚨 We study what happens when LLMs are fine-tuned to be warmer, and find that warmth and sycophancy can be linked, with warm models showing higher errors on a range of benchmarks (🔗s below)

English

268

36.6K

Antoine Moulin@antoine_mln·29 Nis

@lujainmibrahim @Nature @oiioxford congrats!!

English

264

Andrea Zanette@Zanette_ai·27 Nis

Excited to share that our lab will present two Orals at the ICLR SPOT workshop this Monday: • Maximum Likelihood Reinforcement Learning (10:10–10:20) — 🏆 Best Paper Award • Expanding the Capabilities of Reinforcement Learning via Text Feedback (10:20–10:30) — Oral + 🏆 Outstanding Paper Award at LLA Workshop Come and say hi!

English

3.9K

Antoine Moulin@antoine_mln·27 Nis

@Zanette_ai Congrats to the authors!!

English

232

Antoine Moulin retweetledi

Emily Cheng@sparse_emcheng·21 Nis

I'm at #ICLR2026 presenting a poster 04/23! We all want to control GenAI models, but we lack tools to properly evaluate the limits of control. Here, we introduce algorithms to rigorously estimate controllable sets of any GenAI model with guarantees. Work from interning @Apple

English

2.2K

Antoine Moulin retweetledi

Noah Golowich@GolowichNoah·23 Nis

Excited about a couple of papers of ours in ICLR this year (both in Poster Session 1 Pavilion 3 & Oral Session 2B tomorrow): (1) Sequences of Logits Reveal the Low-Rank Structure of Language Models (joint w/ @axliu42 & @AShettyV) arxiv.org/pdf/2510.24966. 1/n

English

Antoine Moulin@antoine_mln·21 Nis

@nanjiang_cs looking forward to it!!

English

224

Nan Jiang@nanjiang_cs·21 Nis

Below is our “textbook” understanding of OPE with value-function approximation. Turns out some of them are not quite right/superficial; guess which ones need to go? ALL OF THEM!! IMSI Talk on Wed (not at ICLR): 1 ICLR paper + 1 preprint imsi.institute/activities/the…

English

21K

Antoine Moulin retweetledi

Yuda Song@yus167·20 Nis

I will be at ICLR 🇧🇷 this week and give some oral presentations and a panel discussion during the workshop days. Happy to discuss anything about RL (LLMs, robotics, theory).

English

3.6K

Antoine Moulin@antoine_mln·20 Nis

@g_k_swamy congrats!!

English

183

Gokul Swamy@g_k_swamy·20 Nis

it took a minute, but i'm proud to share that i'm finally "Dr. Swamy" :)

English

305

18.4K

Antoine Moulin@antoine_mln·17 Nis

@Fabien_Mikol @gabrielpeyre ah j'avais pas vu !

Français

124

Fabien@Fabien_Mikol·17 Nis

@antoine_mln @gabrielpeyre Il y avait eu aussi Oudeyer ! x.com/Fabien_Mikol/s…

Fabien@Fabien_Mikol

Très intéressante audition de Pierre-Yves Oudeyer (informaticien et directeur de recherche Inria) devant la Mission d'information sur l'IA. Voilà qui nous change. Extraits de son propos sur ce que peut apporter l'IA pour la découverte et la créativité, en science et en art :

Français

2.7K

Fabien@Fabien_Mikol·16 Nis

Virginie Bonnaillie-Noël (ENS) et @gabrielpeyre : on a "très clairement une rupture", des post-docs rapportant que leur thèse peut désormais être faite par l'IA en quelques minutes plutôt qu'en 3 ans ; "les performances sont assez incroyables", et "ça change incroyablement vite".

Français

267

238.8K

Antoine Moulin@antoine_mln·16 Nis

legend

ACM SIGecom@AcmSIGecom

📢 Announcing the 2025 SIGecom Doctoral Dissertation Awardees! 🏆 Winner: @GolowichNoah (MIT), advised by @KonstDaskalakis and Ankur Moitra, for the thesis: "Theoretical Foundations for Learning in Games and Dynamic Environments"

English

453

Antoine Moulin retweetledi

Nived Rajaraman@Nived_Rajaraman·13 Nis

Through what mechanisms can reasoning models learn faster by choosing what problems to train on, and what are the limits? Part I of a new series: "Learning to Reason with Curriculum", where we explore algorithmic principles for overcoming the limitations of pre-trained models and data. w/ Audrey Huang (@auddery), Miro Dudik (@MiroDudik), Rob Schapire, Dylan Foster (@canondetortugas) and Akshay Krishnamurthy. [1/12]

English

8.3K

Antoine Moulin@antoine_mln·31 Mar

@andreamichi @Mononofu Congrats Andrea!!

English

231

Andrea Michi@andreamichi·31 Mar

depthfirst has raised an $80M Series B at a $580M valuation. Attackers are using AI to break into systems faster than ever before. depthfirst is on a mission to stop this. RT + Comment “depthfirst” and I’ll send you a FREE vibe coding security agent.

English

121

113

525

1.6M

Antoine Moulin@antoine_mln·25 Mar

@Yadkori This also means @EurIPSConf is subject to the same rule now that it’s an official satellite event… Surprised the organizers agreed to this!

English

612

Yasin Abbasi Yadkori@Yadkori·25 Mar

Very disappointing. That’s one less area chair responsibility for me. If I hadn’t already committed to colleagues, I wouldn’t submit a paper this year either.

机器之心 JIQIZHIXIN@jiqizhixin

Breaking: Academic freedom no more. The NeurIPS Foundation has announced it will no longer accept submissions from US-sanctioned institutions.

English

11.3K

Keşfet

@jsuarez @_arohan_ @adinapak_ @GoogleDeepMind @hbouammar @icmlconf @Nature @lujainmibrahim