Wassim (Wes) Bouaziz

3.4K posts

Wassim (Wes) Bouaziz

@_Vassim

AI Scientist - Safety & Security @MistralAI | PhD from @MetaAI and @Polytechnique I confront equations and inequalities💡 my tweets reflect my own views only.

Paris' suburb Katılım Aralık 2010

2.3K Takip Edilen772 Takipçiler

Wassim (Wes) Bouaziz retweetledi

Ambroise Odonnat @ ICML 2026@AmbroiseOdonnat·22 Haz

In the last project, we study transformer training in a controllable and interpretable setting, providing a visual sandbox to monitor tokens during gradient descent. Thanks to @_Vassim and Vivien Cabannes for the really nice collab. 📜openreview.net/pdf?id=eJTS0bh…

English

184

Wassim (Wes) Bouaziz retweetledi

Kunhao Zheng@KunhaoZ·28 May

🧵 For 2 RL checkpoints trained differently, you can just weight extrapolate them and it works! Bonus: these extrapolated checkpoints are complementary policies -> Get exploration and diversity for free -> Better inference scaling when ensembling Paper: arxiv.org/abs/2605.28751

GIF

Rosinality@rosinality

arxiv.org/abs/2605.28751 Now many studies try to do extrapolation through model merging. arxiv.org/abs/2605.26484

English

125

14.6K

Wassim (Wes) Bouaziz@_Vassim·22 Nis

I'm actually in Rio 🇧🇷☀️ this week for ICLR to present Winter Soldier ❄️! Keen to meet folks working on: AI Security, Reasoning, Memory...😏🤐 We also have open positions at Mistral AI on several topics, including AI Safety! 🛡️ Drop a reply or DM me if you want to chat !

English

311

Wassim (Wes) Bouaziz@_Vassim·22 Nis

I'm particularly proud of the Winter Soldier project, being my last PhD project, and it's great to see the broader community engaging with these ideas 🤗 Data are not just passive inputs, and training on them could lead to similar risks as running an untrusted program 🛡️💻

English

273

Wassim (Wes) Bouaziz@_Vassim·22 Nis

Excited to see the recent discussions around trait transfer in LLMs! 🦉 It validates the idea of Indirect Data Poisoning we introduced in our Winter Soldier paper (arxiv.org/abs/2506.14913) which predates the Subliminal Learning work. It's important to connect these lines of work.

Owain Evans@OwainEvans_UK

Our paper on Subliminal Learning was just published in Nature! Last July we released our preprint. It showed that LLMs can transmit traits (e.g. liking owls) through data that is unrelated to that trait (numbers that appear meaningless). What’s new?🧵

English

5.8K

Wassim (Wes) Bouaziz@_Vassim·8 Nis

Numbers in blue are blue 🤷‍♂️

Alexandr Wang@alexandr_wang

1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵

English

2.2K

Wassim (Wes) Bouaziz@_Vassim·17 Mar

We have tons of terminal apps but all rely on the same outdated multiplexer... Use a terminal that supports tmux's control mode (`tmux -CC`) and you'll never have to learn a single tmux "short"cut again

@levelsio@levelsio

I hate tmux It's so incredibly user unfriendly The shortcuts make no sense I wish someone would make a better tmux Even just logging into tmux attaching the screen is an illogical hell to type Again I hate tmux, it's so shit

English

276

Wassim (Wes) Bouaziz retweetledi

Ambroise Odonnat @ ICML 2026@AmbroiseOdonnat·13 Şub

✨Vision Transformer finetuning benefits from non-smooth components 🔍Our new paper shows that high-plasticity transformer modules adapt better during finetuning. 🍕 Whether you prefer theory or experiments, we hope you'll find something you like in this work. Details below 🧵

GIF

English

8.9K

Wassim (Wes) Bouaziz retweetledi

Basile Terver@BasileTerv987·12 Oca

My first PhD paper is out! 🎓 "What Drives Success in Physical Planning with Joint-Embedding Predictive World Models?" tl:dr: JEPA-WMs for robotics: learn dynamics on top of visual encoders, optimize actions towards goal 👇 w/ @JimmyTYYang1, Jean Ponce, @AdrienBardes, @ylecun

English

110

947

147.2K

Wassim (Wes) Bouaziz@_Vassim·19 Ara

Big milestone 🎓✨ I’ve successfully defended my PhD thesis at @Polytechnique in collaboration with @AIatMeta ! "Towards Secure and Trustworthy Machine Learning: From Data Poisoning to Ownership Verification" Grateful to my advisors, jury, and everyone who supported me 🙏

English

604

Wassim (Wes) Bouaziz@_Vassim·27 Kas

@iclr_conf Well... NeurIPS is gonna be fun 👀

GIF

English

992

Wassim (Wes) Bouaziz@_Vassim·27 Kas

Open review leak has actual implications... It's safe to assume anonymity is compromised this year for @iclr_conf

English

5.6K

Wassim (Wes) Bouaziz retweetledi

Piotr Bojanowski@p_bojanowski·14 Ağu

I am happy to share the work of our team. The outcome of a collaborative effort, by a joyful group of skilled and determined scientists and engineers! Congrats to the team on this amazing milestone!

AI at Meta@AIatMeta

Introducing DINOv3: a state-of-the-art computer vision model trained with self-supervised learning (SSL) that produces powerful, high-resolution image features. For the first time, a single frozen vision backbone outperforms specialized solutions on multiple long-standing dense prediction tasks. Learn more about DINOv3 here: ai.meta.com/blog/dinov3-se…

English

220

19.9K

Wassim (Wes) Bouaziz retweetledi

Federico Baldassarre@BaldassarreFe·14 Ağu

Say hello to DINOv3 🦖🦖🦖 A major release that raises the bar of self-supervised vision foundation models. With stunning high-resolution dense features, it’s a game-changer for vision tasks! We scaled model size and training data, but here's what makes it special 👇

English

260

1.9K

224.5K

Wassim (Wes) Bouaziz@_Vassim·12 Ağu

Peer review ML conferences: > Only "top" X% get in > Jobs, grants, bonuses hinge on it > No penalty for bad-faith reviews > No cost for flooding submissions Who could have seen this going wrong? 🤭

Yi Ma@YiMaTweets

Of course, what do you expect from a conference that receives over 20,000 submissions?

English

484

Wassim (Wes) Bouaziz retweetledi

Belen Alastruey@b_alastruey·6 Ağu

🚀New paper alert! 🚀 In our work @AIatMeta we dive into the struggles of mixing languages in largely multilingual Transformer encoders and use the analysis as a tool to better design multilingual models to obtain optimal performance. 📄: arxiv.org/abs/2508.02256 🧵(1/n)

English

6.3K

Wassim (Wes) Bouaziz retweetledi

João Maria Janeiro@JoaoMJaneiro·28 Tem

If you are attending ACL2025 join our oral presentation! Happening at 15:00 in room 1.86 🙂

João Maria Janeiro@JoaoMJaneiro

Very happy that our paper MEXMA has been featured on the bundle release by @AIatMeta 🎉 paper: arxiv.org/abs/2409.12737 code: github.com/facebookresear… Model: huggingface.co/facebook/MEXMA Checkout the full bundle here: ai.meta.com/blog/fair-news…

English

802

Wassim (Wes) Bouaziz@_Vassim·23 Tem

@OwainEvans_UK Interesting work! We found that similar misalignment can be implanted via Indirect Data Poisoning at pre-training time: arxiv.org/abs/2506.14913 Would love to hear your thoughts 😄

English

Owain Evans@OwainEvans_UK·22 Tem

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

English

279

1.1K

8.3K

Wassim (Wes) Bouaziz retweetledi

Ambroise Odonnat @ ICML 2026@AmbroiseOdonnat·22 Tem

🚀 We are happy to organize the BERT²S workshop @NeurIPSConf 2025 on Recent Advances in Time Series Foundation Models. 🌐 berts-workshop.github.io 📜Submit by August 22 🎓Speakers and panelists: @ChenghaoLiu15 Mingsheng Long @zoe_piran @danielle_maddix @atalwalkar @qingsongedu

Ambroise Odonnat @ ICML 2026 tweet media

English

Wassim (Wes) Bouaziz retweetledi

Ken Liu@kenziyuliu·12 Tem

heading to @icmlconf #ICML2025 next week! come say hi & i'd love to learn about your work :) i'll present this paper (arxiv.org/abs/2503.17514) on the pitfalls of training set inclusion in LLMs, Thursday 11am here are my talk slides to flip through: ai.stanford.edu/~kzliu/files/m…

Ken Liu@kenziyuliu

An LLM generates an article verbatim—did it “train on” the article? It’s complicated: under n-gram definitions of train-set inclusion, LLMs can complete “unseen” texts—both after data deletion and adding “gibberish” data. Our results impact unlearning, MIAs & data transparency🧵

English

303

39K

Keşfet

@JimmyTYYang1 @AdrienBardes @ylecun @Polytechnique @AIatMeta @iclr_conf @OwainEvans_UK @NeurIPSConf