Antoine Moulin

432 posts

Antoine Moulin

Antoine Moulin

@antoine_mln

doing a phd in RL/online learning on questions related to exploration and adaptivity

Katılım Ağustos 2020
534 Takip Edilen1.4K Takipçiler
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
I am attempting to solve curriculum learning in RL in the next 2 weeks. Join me every day for 6-8 hours of livestreamed research, a 10 km run, heavy compounds, and a 1,000 calorie cut. Details + ground rules on day 1!
Joseph Suarez 🐡 tweet media
English
26
26
624
48.1K
rohan anil
rohan anil@_arohan_·
@adinapak_ Both are restaurants in london. Highly recommend them
English
2
0
1
337
rohan anil
rohan anil@_arohan_·
How do I buy spv on dishoom and hoppers?
English
6
2
42
6.8K
Antoine Moulin retweetledi
Timothy Gowers @wtgowers
Timothy Gowers @wtgowers@wtgowers·
But if AI mathematics continues to progress at anything like its current rate -- which is what I expect to happen -- then we will face a crisis very soon, and mathematics departments, who owe a duty of care to their students, should be urgently preparing for it.
English
71
146
1.4K
505.7K
Antoine Moulin retweetledi
Pushmeet Kohli
Pushmeet Kohli@pushmeet·
The future of Math is mathematicians and AI agents working together. Very pleased to introduce @GoogleDeepMind's AI co-mathematician: a multi-agent system designed to actively collaborate with human experts on open-ended research mathematics. Mathematicians testing the agent across areas as diverse as group theory, Hamiltonian systems, and algebraic combinatorics have reported impressive results. In autonomous mode evaluation on the rigorous FrontierMath Tier 4 problems, AI co-mathematician scored an unprecedented 48% — a new high score among all AI systems evaluated.
Pushmeet Kohli tweet media
English
171
368
2.6K
309.2K
Lujain Ibrahim
Lujain Ibrahim@lujainmibrahim·
🚨Very excited to see our work on warmth & sycophancy in LLMs out in @Nature today!🚨 We study what happens when LLMs are fine-tuned to be warmer, and find that warmth and sycophancy can be linked, with warm models showing higher errors on a range of benchmarks (🔗s below)
Lujain Ibrahim tweet media
English
14
61
268
36.6K
Andrea Zanette
Andrea Zanette@Zanette_ai·
Excited to share that our lab will present two Orals at the ICLR SPOT workshop this Monday: • Maximum Likelihood Reinforcement Learning (10:10–10:20) — 🏆 Best Paper Award • Expanding the Capabilities of Reinforcement Learning via Text Feedback (10:20–10:30) — Oral + 🏆 Outstanding Paper Award at LLA Workshop Come and say hi!
English
2
11
53
3.9K
Antoine Moulin retweetledi
Emily Cheng
Emily Cheng@sparse_emcheng·
I'm at #ICLR2026 presenting a poster 04/23! We all want to control GenAI models, but we lack tools to properly evaluate the limits of control. Here, we introduce algorithms to rigorously estimate controllable sets of any GenAI model with guarantees. Work from interning @Apple
Emily Cheng tweet media
English
2
5
33
2.2K
Antoine Moulin retweetledi
Noah Golowich
Noah Golowich@GolowichNoah·
Excited about a couple of papers of ours in ICLR this year (both in Poster Session 1 Pavilion 3 & Oral Session 2B tomorrow): (1) Sequences of Logits Reveal the Low-Rank Structure of Language Models (joint w/ @axliu42 & @AShettyV) arxiv.org/pdf/2510.24966. 1/n
English
1
7
59
5K
Nan Jiang
Nan Jiang@nanjiang_cs·
Below is our “textbook” understanding of OPE with value-function approximation. Turns out some of them are not quite right/superficial; guess which ones need to go? ALL OF THEM!! IMSI Talk on Wed (not at ICLR): 1 ICLR paper + 1 preprint imsi.institute/activities/the…
Nan Jiang tweet media
English
3
2
38
21K
Antoine Moulin retweetledi
Yuda Song
Yuda Song@yus167·
I will be at ICLR 🇧🇷 this week and give some oral presentations and a panel discussion during the workshop days. Happy to discuss anything about RL (LLMs, robotics, theory).
Yuda Song tweet media
English
1
4
45
3.6K
Gokul Swamy
Gokul Swamy@g_k_swamy·
it took a minute, but i'm proud to share that i'm finally "Dr. Swamy" :)
Gokul Swamy tweet media
English
39
2
305
18.4K
Fabien
Fabien@Fabien_Mikol·
Virginie Bonnaillie-Noël (ENS) et @gabrielpeyre : on a "très clairement une rupture", des post-docs rapportant que leur thèse peut désormais être faite par l'IA en quelques minutes plutôt qu'en 3 ans ; "les performances sont assez incroyables", et "ça change incroyablement vite".
Français
32
62
267
238.8K
Antoine Moulin retweetledi
Nived Rajaraman
Nived Rajaraman@Nived_Rajaraman·
Through what mechanisms can reasoning models learn faster by choosing what problems to train on, and what are the limits? Part I of a new series: "Learning to Reason with Curriculum", where we explore algorithmic principles for overcoming the limitations of pre-trained models and data. w/ Audrey Huang (@auddery), Miro Dudik (@MiroDudik), Rob Schapire, Dylan Foster (@canondetortugas) and Akshay Krishnamurthy. [1/12]
English
1
11
42
8.3K
Andrea Michi
Andrea Michi@andreamichi·
depthfirst has raised an $80M Series B at a $580M valuation. Attackers are using AI to break into systems faster than ever before. depthfirst is on a mission to stop this. RT + Comment “depthfirst” and I’ll send you a FREE vibe coding security agent.
English
121
113
525
1.6M
Antoine Moulin
Antoine Moulin@antoine_mln·
@Yadkori This also means @EurIPSConf is subject to the same rule now that it’s an official satellite event… Surprised the organizers agreed to this!
English
1
0
8
612