
Mythos Preview also solved "Cooling Tower", our industrial control system range, in 3 of 10 attempts.
Ekin Zorer
125 posts

@ekinomicss
borrowed stardust, technical staff cyber and autonomous systems team @AISecurityInst, 👩💻🕊️🚴🏼♀️☕️🏔️🌳🐱🎬📚

Mythos Preview also solved "Cooling Tower", our industrial control system range, in 3 of 10 attempts.



We know AI systems occasionally act against their operators’ intentions – but what in their environment causes them to do so? In a new paper, we make progress on this question 🧵





We conducted cyber evaluations of Claude Mythos Preview and found that it is the first model to complete an AISI cyber range end-to-end. 🧵



We conducted cyber evaluations of Claude Mythos Preview and found that it is the first model to complete an AISI cyber range end-to-end. 🧵




In a new paper from MIT FutureTech, my co-authors and I investigate AI performance across *thousands* of representative tasks in the U.S. economy. We find that capabilities are already high and rising quickly, though not in the same “craching wave” pattern found by @METR_Evals ⬇️

