Maxime Stauffer

195 posts

Maxime Stauffer

@MaximeStauffer

Katılım Mart 2021

231 Takip Edilen209 Takipçiler

Maxime Stauffer retweetledi

Nora Ammann@AmmannNora·1d

I'm hiring! The role is permanent, based in London (hybrid), GBP70-105k depending on experience. Deadline is May 3.

Nora Ammann@AmmannNora

In related news… I’m building out a tiger team to pursue this mission with me! 🦸 I’m looking for people who are mission-driven, technically deep, and comfortable moving between formal methods, programming languages, AI, AI safety, and cybersecurity.

English

112

28.4K

Maxime Stauffer@MaximeStauffer·6d

New month, new adventure

chiara maharani ✧@chiaragerosa

...@MaximeStauffer & @jpsnoeij who'll take me on a surprise adventure every other month (the only clue i have for the next one is 'sweat') ...and another from my wonderful friend @zadig_1 who illustrated a poem i wrote when i was 15 and turned it into a children's book for me

English

Maxime Stauffer@MaximeStauffer·6d

Very excited about your new paper

Matija Franklin@FranklinMatija

Excited about our new paper: AI Agent Traps AI agents inherit every vulnerability of the LLMs they're built on - but their autonomy, persistence, and access to tools create an entirely new attack surface: the information environmental itself. The web pages, emails, APIs, and databases agents interact with can all be weaponised against them. We introduce a taxonomy of six classes of adversarial threats - from prompt injections hidden in web pages to systemic attacks on multi-agent networks. I’m outlining the six categories of traps in the thread bellow

English

256

Maxime Stauffer retweetledi

Boaz Barak@boazbaraktcs·30 Mar

New blog post: the state of AI safety in four fake graphs.

English

148

1.4K

596.2K

Maxime Stauffer@MaximeStauffer·17 Mar

@chiaragerosa @jpsnoeij The other day

English

chiara maharani ✧@chiaragerosa·16 Mar

@jpsnoeij I have a question for you

English

chiara maharani ✧@chiaragerosa·16 Mar

I've been writing poetry since I was 14 and back then it was a cringe thing to enjoy doing. My poems were admittedly cringe too. But the older I get, the more I see friends starting to write poetry. What's that about

English

222

Maxime Stauffer retweetledi

Vivian@suchnerve·5 Mar

more like the Geneva Suggestions

English

10K

63.3K

621.7K

Maxime Stauffer retweetledi

Séb Krier@sebkrier·24 Şub

I'm spending more cognitive effort than I'd like parsing documents that are clearly 'lazily prompted low-effort AI outputs with some plausible deniability formatting cleanups' rather than 'AI-assisted, filtered, and finely crafted for a bespoke purpose and audience'.

English

131

Maxime Stauffer retweetledi

chiara maharani ✧@chiaragerosa·23 Şub

- deeply understanding the female experience (biology, emotions, reactions) - leaning into healthy masculinity; exploring femininity; playing with both confidently and attractively - appreciation for the complexity of human relationships, not over-simplifying them - sitting with tension in relationships, not needing to fix things immediately - attention to detail & beauty in physical space - a deeper appreciation for gift-giving

English

2.4K

Maxime Stauffer retweetledi

César A. Hidalgo@cesifoti·22 Şub

This weekend I made a game from scratch to play with my 12 year old daughter. If you are a dad, you know how popular multiplayer games are with tweens. With Claude, I was able to whip out a game in two days & collaborate with my daughter on features & gameplay. Amazing family experience!

English

5.4K

Maxime Stauffer retweetledi

Jai@Laneless_·21 Şub

@peterwildeford

QME

1.9K

Maxime Stauffer retweetledi

Chris Painter@ChrisPainterYup·20 Şub

Our team is stretched thin at the moment! To continue upper-bounding the autonomy of AI agents, and developing evaluations for monitoring AI systems and their propensity to subvert human control, we need more great engineering and research staff. Please apply below or DM me!

METR@METR_Evals

We estimate that Claude Opus 4.6 has a 50%-time-horizon of around 14.5 hours (95% CI of 6 hrs to 98 hrs) on software tasks. While this is the highest point estimate we’ve reported, this measurement is extremely noisy because our current task suite is nearly saturated.

English

354

62.5K

Maxime Stauffer@MaximeStauffer·21 Şub

This is very important work

Luca Righetti@lucafrighetti

In theory, almost everyone agrees AI policy should be evidence-based. In practice, science is messy and can sit uneasily with "yes/no" answers. I helped run a big RCT on AI-biology risk. Here are five (meta) lessons from what I learned and where I think bio evals need to go:

English

Maxime Stauffer retweetledi

Multiagent Systems Papers@PIN·10 Şub

ValueFlow: Measuring the Propagation of Value Perturbations in Multi-Agent LLM Systems Jinnuo Liu, Chuke Liu, Hua Shen arxiv.org/abs/2602.08567 [𝚌𝚜.𝙼𝙰 𝚌𝚜.𝙲𝙻]

English

Maxime Stauffer retweetledi

Séb Krier@sebkrier·1 Şub

The Moltbook stuff is still mostly a nothingburger if you've been following things like the infinite backrooms, the extended Janus universe, Stanford's Smallville, Large Population Models, DeepMind's Concordia, SAGE's AI Village, and many more. Of course the models get better over time and so the interactions get richer, the tools called are more sophisticated and so on. I'll concede that at least it's making multi-agent dynamics a bit easier to understand for people who are blessed with not spending their days interacting with models and monitoring ArXiv. The risk side is easy to grok - it always is! Humans are very good at freaking out. And whilst I like poking fun at the prophets of doom and the anxiety/neuroticism fueled parts of the AI ecosystem, it's plainly true that safety is important. So it's a good time to remind people of the Distributional AGI Safety paper (arxiv.org/abs/2512.16856) and the Multi-Agent Risks from Advanced AI paper (arxiv.org/abs/2502.14143). There's a lot to research here still. As usual, this will benefit from people with deep knowledge in all sorts of domains like economics, game theory, psychology, cybersecurity, mechanism design, and many more. Maybe this is the year we will get better protocols to incentivize coordination and collaboration without the downsides, mechanism design and reputation systems to discourage malicious actors, and walled gardens and proof of humanity to better filter slop. And risks aside - I think there's so much to be researched to help enable positive sum flywheels: using agents to solve coordination problems, OSINT agent platforms to hold power accountable, decentralised anonymized dataset creation for social good, aggregating dispersed knowledge without the usual pathologies (Community Notes for everything!), simulations of social and political dynamics, multi-agent systems that stress-test policy proposals, contracts, or governance mechanisms by simulating diverse strategic actors trying to game them etc. It's time to build!

English

142

1.1K

165.5K

Maxime Stauffer@MaximeStauffer·23 Ara

@TylerAlterman fractalgva.ch (and its focused humans) is always happy to welcome you

English

Tyler is finishing a book, slow to reply@TylerAlterman·23 Ara

I'm thinking about spending January somewhere where it's easier to focus than NYC. Ideas?

English

8.9K

Maxime Stauffer@MaximeStauffer·8 Ara

@praeterproptr -1 to 0 sounds like #sanityandsainthood so maybe this is also @jpsnoeij's alley

English

Maxime Stauffer@MaximeStauffer·8 Ara

@praeterproptr Game! As long as someone takes care of the n at some point

English

Maxime Stauffer@MaximeStauffer·7 Ara

See you next semester! Be in touch if you want to teach and/or learn Should I teach improv (using Meisner's theatre technique) or a 0-to-1 course (how to start big projects from scratch)?

chiara maharani ✧@chiaragerosa

fractal uni geneva had its end-of-semester party this weekend :) we got all dressed up and booked a cozy little restaurant owned by a lovely lady who made us a bangin south indian brunch

English

155

Maxime Stauffer retweetledi

chiara maharani ✧@chiaragerosa·7 Ara

fractal uni geneva had its end-of-semester party this weekend :) we got all dressed up and booked a cozy little restaurant owned by a lovely lady who made us a bangin south indian brunch

English

7.2K

Maxime Stauffer@MaximeStauffer·30 Kas

@ayushchopra96 @sebkrier this looks like very exciting work! i wonder: as we scale multi-agent ai systems in terms of numbers of agents, what are the threshold effects we may anticipate? or, put differently, what are the problems that emerge at various scales? this may inform deployment strategy.

English

Ayush Chopra@ayushchopra96·30 Kas

@MaximeStauffer @sebkrier Last 3 years, there has been lot of progress in ABMs + LLMs in multi-agent systems world. Will slowly show up in AI world also as they ack critical need for protocols. Progress is happening: arxiv.org/abs/2510.16572 More: #research" target="_blank" rel="nofollow noopener">iceberg.mit.edu/#research

English

Séb Krier@sebkrier·29 Kas

Huge fan of multi agent systems, agent based modelling, and social intelligence - these frames still seem really absent from mainstream AI discourse except in a few odd places. Some half-baked thoughts: 1. Expecting a model to do all the work, solve everything, come up with new innovations etc is probably not right. This was kinda the implicit assumption behind *some* interpretations of capabilities progress. The 'single genius model' overlooks the fact that inference costs and context windows are finite. 2. People overrate individual intelligence: most innovations are the product of social organisations (cooperation) and market dynamics (competition), not a single genius savant. Though the latter matters too of course: the smarter the agents the better. 3. There's still a lot of juice to be squeezed from models, but I would think it has more to do with how they're organised. AI Village is a nice vignette, and also highlights the many ways in which models fail and what needs to be fixed. 4. Once you enter multi-agent world, then institutions and culture start to matter too: what are the rules of the game? What is encouraged vs what is punished? What can agents do and say to each other? How are conflicts resolved? It's been interesting seeing how some protocols recently emerged. We're still very early! 5. Most of the *value* and transformative changes we will get from AI will come from products, not models. The models are the cognitive raw power, the products are what makes them useful and adapted to what some user class actually needs. A product is basically the bridge between raw potential and specific utility; in fact many IDEs today are essentially crystallized multi agent systems.

English

354

38.6K

Keşfet

@chiaragerosa @jpsnoeij @peterwildeford @TylerAlterman @praeterproptr @elonmusk @BarackObama @taylorswift13