Samuel Vaiter

901 posts

Samuel Vaiter banner
Samuel Vaiter

Samuel Vaiter

@vaiter

@CNRS Researcher

Nice, France Katılım Eylül 2012
349 Takip Edilen3.1K Takipçiler
Samuel Vaiter
Samuel Vaiter@vaiter·
We validate everything empirically on 11 models (GPT-2, Gemma 3, Qwen 3, Llama 2, Mistral 7B) across 8 safety-related concepts. All our theorems are confirmed experimentally. 6/7
Samuel Vaiter tweet media
English
1
0
1
342
Samuel Vaiter
Samuel Vaiter@vaiter·
🧵 New preprint! "Towards Understanding Steering Strength" with @MagamedTm and D. Garreau Activation steering is a popular way to control LLM behavior at inference. But how much should you steer? We provide the first theoretical analysis of the steering strength α. 1/7
Samuel Vaiter tweet media
English
1
18
107
6.8K
Samuel Vaiter
Samuel Vaiter@vaiter·
@SebAaltonen @IceSolst This is the same shit for every craft out there: Oh look 500 hundreds songs a day Oh look 1000 video game assets a day Oh look 15k LOC a day Oh look 50 videos post-processed a day
English
0
0
2
138
Sebastian Aaltonen
Sebastian Aaltonen@SebAaltonen·
@IceSolst Who is going to review those 15k lines produced every day? Quality must be shit. I don't understand why you need to produce that much code. Less is more. Don't want to drown in boilerplate.
English
9
0
152
3.8K
solst/ICE of Astarte
solst/ICE of Astarte@IceSolst·
Btw avg software engineer ships 10-50 LoC/day. A junior with falsely inflated self-confidence might ship 200 shitty lines. You have to be an absolute dumbass disconnected from any figment of reality to claim your goal is 15k/day. It’s a misleading and toxic claim. 15k/day CVEs.
solst/ICE of Astarte tweet media
solst/ICE of Astarte@IceSolst

@0x6e616461 Junior engineer: +50 LoC/day Senior engineer: +300 Staff engineer: -100 Principal engineer: 0 Grifter: +10-15,000

English
240
74
2.1K
604K
Samuel Vaiter
Samuel Vaiter@vaiter·
@chanwoopark20 Maybe of interest to you: we derive a *local* Lipschitz constant of the softmax in this paper #page=27" target="_blank" rel="nofollow noopener">arxiv.org/pdf/2303.07203… (Lemma H.6). It is local, but gives more information than the 1/2 that can be quite pessimistic for small perturbations.
English
0
1
4
225
Chanwoo Park
Chanwoo Park@chanwoopark20·
A somewhat well-known property of the softmax function is that it is 1-Lipschitz (from Gao et al 2018). While thinking about whether this bound is tight, I came across a fun and surprisingly simple result: softmax is in fact 1/2-Lipschitz (Newhouse, Feb 2025). The argument is quite elementary—perhaps too simple for a mathematician—but I nonetheless found it very interesting. Later, I discovered that Nair (Oct 2025) independently obtained the same 1/2-Lipschitz bound. The difference lies in the choice of the ℓp​ norm, but I am fairly confident that the author was unaware of Newhouse’s earlier work as the author did not cite Newhouse's paper. Finally—and perhaps most intriguingly—I found a Math StackExchange post from nearly nine years ago showing that softmax is \sqrt{d-1}/d-Lipschitz. Since we assume d≥2, this immediately implies a 1/2-Lipschitz bound as well. Math StackExchange knows everything...
Chanwoo Park tweet mediaChanwoo Park tweet mediaChanwoo Park tweet mediaChanwoo Park tweet media
English
17
37
398
37.6K
Andrea Montanari
Andrea Montanari@Andrea__M·
Even when all math problems will be solved by AI, we will be left with the really hard questions. Like, can we build a slide projector that works 99.9% of the times?
English
9
4
115
11.6K
Samuel Vaiter
Samuel Vaiter@vaiter·
Pour s'inscrire et pour plus d'informations : mode2026.sciencesconf.org Les orateurs et oratrices pléniers sont Pierre Ablin, Yann Brenier, Julie Delon, Stéphane Gaubert, Francisco J. Silva Álvarez (prix J.J Moreau) et Irène Waldspurger. Attention, le nombre de place est limité !
Français
0
0
0
266
Samuel Vaiter
Samuel Vaiter@vaiter·
Les inscriptions aux journées MODE 2026 à Nice sont désormais ouvertes. Elles se dérouleront du 18 au 20 mars à l'Hôtel Saint-Paul. Les inscriptions sont ouvertes jusqu'au 1 mars (majoration > 9/02). La deadline pour soumettre une communication est le **15 janvier**.
Samuel Vaiter tweet media
Français
1
0
1
296
Samuel Vaiter retweetledi
Fares El Khoury
Fares El Khoury@fares_elkhoury·
Happy to be at #NeurIPS2025 in San Diego to present our poster ‘Learning Theory for Kernel Bilevel Optimization’ #3005, Fri at 4:30 p.m. Stop by/ping me to chat, especially about statistics, causality, generative models! Let's connect! Joint w/ E. Pauwels, @vaiter, @MichaelArbel
English
0
2
4
723
Samuel Vaiter
Samuel Vaiter@vaiter·
Les exposés pléniers seront donnés par : Pierre Ablin, Julie Delon, Stéphane Gaubert, Irène Waldspurger, ainsi que que le/la lauréat(e) du prix J.J. Moreau. Le mini-cours sera sur les PEP (Performance Estimation Problem) par A. Dieuleveut et A. Taylor
Français
0
0
1
322
Samuel Vaiter
Samuel Vaiter@vaiter·
Les journées SMAI-MODE 2026 auront lieu à Nice du 18 au 20 mars 2026, précédées par un mini-cours les 16/17. Vous pouvez déposer votre proposition de contrib sur mode2026.sciencesconf.org Inscription à partir du 1er décembre, fin le 1 mars (tarif majoré à compter du 1 février).
Français
1
2
1
463
Samuel Vaiter
Samuel Vaiter@vaiter·
At the same time, I cannot write this message without telling that I am **very** worried about the current administrative evolution of our profession which is evolving quickly towards a system absolutely not calibrated to our duties with a purely accounting PoV.
English
0
0
2
266
Samuel Vaiter
Samuel Vaiter@vaiter·
(*) directeur de recherche is roughly equivalent to full prof w/o mandatory teaching duty in France. This is an incredible tenured civil servant position, and I am deeply grateful for it. DM me if you have questions about the process to enter CNRS.
English
1
0
1
292
Samuel Vaiter
Samuel Vaiter@vaiter·
Happy to share that I have been promoted to senior research scientist (directeur de recherche (*)) at @cnrs effective today! Grateful to my amazing students, collaborators and mentors who made this journey possible. I will continue my research at the math lab of UniCA (LJAD).
English
4
0
24
802