
In related news… I’m building out a tiger team to pursue this mission with me! 🦸 I’m looking for people who are mission-driven, technically deep, and comfortable moving between formal methods, programming languages, AI, AI safety, and cybersecurity.
Maxime Stauffer
195 posts


In related news… I’m building out a tiger team to pursue this mission with me! 🦸 I’m looking for people who are mission-driven, technically deep, and comfortable moving between formal methods, programming languages, AI, AI safety, and cybersecurity.

...@MaximeStauffer & @jpsnoeij who'll take me on a surprise adventure every other month (the only clue i have for the next one is 'sweat') ...and another from my wonderful friend @zadig_1 who illustrated a poem i wrote when i was 15 and turned it into a children's book for me



We estimate that Claude Opus 4.6 has a 50%-time-horizon of around 14.5 hours (95% CI of 6 hrs to 98 hrs) on software tasks. While this is the highest point estimate we’ve reported, this measurement is extremely noisy because our current task suite is nearly saturated.

In theory, almost everyone agrees AI policy should be evidence-based. In practice, science is messy and can sit uneasily with "yes/no" answers. I helped run a big RCT on AI-biology risk. Here are five (meta) lessons from what I learned and where I think bio evals need to go:







fractal uni geneva had its end-of-semester party this weekend :) we got all dressed up and booked a cozy little restaurant owned by a lovely lady who made us a bangin south indian brunch



