Sabitlenmiş Tweet
Antoine Maier
13 posts

Antoine Maier
@antmaier
AI Security researcher at General-Purpose AI Policy Lab
Katılım Ekim 2025
10 Takip Edilen4 Takipçiler

Read the full paper, it's available here: arxiv.org/abs/2510.02840
Thanks to my co-author Aude Maier, and the team at the General-Purpose AI Policy Lab (@TomDAAVID, @PierrePeigne_, @JyAndreoletti, gpaipolicylab.org)!
13/13
English

As situational awareness, long-horizon planning (@METR_Evals), and other general capabilities improve (@EpochAIResearch), systems become better at anticipating and countering human intervention, up to a threshold beyond which their performance surpasses human control.
12/
English

Our argument doesn't rely on any specific learning paradigm. Supervised, reinforcement, unsupervised, ... almost every learning task is just 𝗼𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻.
They all fit into the 'Standard Model of Machine Learning' (@ZhitingHu, @ericxing).
2/

English
