Antoine Maier

13 posts

Antoine Maier

Antoine Maier

@antmaier

AI Security researcher at General-Purpose AI Policy Lab

Katılım Ekim 2025
10 Takip Edilen4 Takipçiler
Sabitlenmiş Tweet
Antoine Maier
Antoine Maier@antmaier·
⚠️ "When a measure becomes a target, it ceases to be a good measure." But wait, isn't it exactly how we train an AI? What actually happens when you push optimization too far? We went looking for a rigorous answer. None existed. So we built one from first principles. 👇 1/13
Antoine Maier tweet media
English
1
2
6
1.2K
Antoine Maier
Antoine Maier@antmaier·
As situational awareness, long-horizon planning (@METR_Evals), and other general capabilities improve (@EpochAIResearch), systems become better at anticipating and countering human intervention, up to a threshold beyond which their performance surpasses human control. 12/
English
1
0
3
42
Antoine Maier
Antoine Maier@antmaier·
First question: is a model trained on a loss L a minimum of L? ❌No. There are always errors. Regardless of how well L is specified, as long as a non-zero error remains, we know for sure that, strictly speaking, it does not satisfy the learning objective. 3/
English
1
0
3
56
Antoine Maier
Antoine Maier@antmaier·
Our argument doesn't rely on any specific learning paradigm. Supervised, reinforcement, unsupervised, ... almost every learning task is just 𝗼𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻. They all fit into the 'Standard Model of Machine Learning' (@ZhitingHu, @ericxing). 2/
Antoine Maier tweet media
English
1
0
3
53
Antoine Maier
Antoine Maier@antmaier·
⚠️ "When a measure becomes a target, it ceases to be a good measure." But wait, isn't it exactly how we train an AI? What actually happens when you push optimization too far? We went looking for a rigorous answer. None existed. So we built one from first principles. 👇 1/13
Antoine Maier tweet media
English
1
2
6
1.2K