Ekin Zorer

125 posts

Ekin Zorer

@ekinomicss

borrowed stardust, technical staff cyber and autonomous systems team @AISecurityInst, 👩‍💻🕊️🚴🏼‍♀️☕️🏔️🌳🐱🎬📚

london Katılım Mayıs 2009

286 Takip Edilen459 Takipçiler

Ekin Zorer@ekinomicss·14 May

An internal highlight was the Mythos solve on another one of our cyber ranges, where the objective is disrupting physical processes in a simulated power plant

AI Security Institute@AISecurityInst

Mythos Preview also solved "Cooling Tower", our industrial control system range, in 3 of 10 attempts.

English

231

Ekin Zorer@ekinomicss·11 May

@ArthurConmy beautiful :0 any fave road biking routes from the city 👀

English

121

Arthur Conmy@ArthurConmy·11 May

yookay early summer evening cardio >>

English

3.8K

Ekin Zorer@ekinomicss·26 Nis

@ErenChenAI haha amazing. thanks for giving us this gem

English

Eren Chen@ErenChenAI·26 Nis

@ekinomicss Twitter algorithm is amazing. Follow me!

English

Ekin Zorer@ekinomicss·26 Nis

Misaligned robot exhibiting malicious behavior spotted in #ICLR2026

English

297

33.3K

Ekin Zorer@ekinomicss·26 Nis

Don’t worry guys, UK AISI was there monitoring the situation

English

1.1K

Ekin Zorer@ekinomicss·24 Nis

UK AISI building the field of propensity science. We need to move on from anecdotal posts to scientifically measuring language models' propensity for unsanctioned behaviour. Also, GLMs for the win!

AI Security Institute@AISecurityInst

We know AI systems occasionally act against their operators’ intentions – but what in their environment causes them to do so? In a new paper, we make progress on this question 🧵

English

1.7K

Ekin Zorer@ekinomicss·24 Nis

@shannonyangsky real

English

243

Shannon Yang@shannonyangsky·23 Nis

can someone explain why a man in the middle of the Riocentro courtyard is on the microphone making throat sounds roughly like: “ë é. è è. ê ě. ā ø. ẽ ē. ė ę. é ő. ě ẽ!”

English

5.6K

Ekin Zorer@ekinomicss·15 Nis

I’m also, ahem, looking for advanced/intermediate beach volleyballers to send some games with outside the conference. Do DM any leads 👀👀

English

292

Ekin Zorer@ekinomicss·15 Nis

Come say hi to me at ICLR 2026! I'm keen to chat to people about: evals infra and sandboxing, cyber ranges, cyber blue agents, opsec environments, measuring autonomy and path dependence, AI R&D evals... also we are hiring 👀 luma.com/v426cl4n?tk=2Z…

English

771

Ekin Zorer@ekinomicss·14 Nis

Previous work I like on this by @iapsAI iaps.ai/research/diffe… (supported by UK AISI!)

English

230

Ekin Zorer@ekinomicss·14 Nis

Tbh pretty crazy that we're already in the age of differential access. Giving defenders a leg-up is a no brainer. The cyber community should be poking at inference providers next. If open-weights models are catching up in capabilities, the access burden won't only fall to labs

English

746

Ekin Zorer@ekinomicss·13 Nis

I definitely think so for those in uni and in the field already! More uncertain for younger peeps Real life deployment info still is depends on industry insiders. The scale and stakes of attacks will get higher, so we need defenders building blue agents, automated mitigations and doing expert verification on LLM data

English

258

KeepReading🦬@LutheranNerd·13 Nis

@ekinomicss Does this bode well for the future of cyber security careers in your opinion?

English

368

Ekin Zorer@ekinomicss·13 Nis

This was... an interesting one. Reminder that we run independent evals on our cyber ranges that labs don't have access to. Exploitation capabilities are getting seriously good. Mythos is the first model to complete our full 32-step corporate network attack sim E2E.

AI Security Institute@AISecurityInst

We conducted cyber evaluations of Claude Mythos Preview and found that it is the first model to complete an AISI cyber range end-to-end. 🧵

English

249

31K

Ekin Zorer@ekinomicss·13 Nis

@dcuthbert arxiv.org/pdf/2603.11214… for more info on range environment. We don't release these fully to avoid eval validity & training contamination

English

317

Daniel Cuthbert@dcuthbert·13 Nis

@ekinomicss It's cool you've done this but will there be further details about said labs? That's the kicker here for me at least. it's either some php web app and a windows server 2003 box or hybrid azure with Windows Hello and a well-designed AD.

English

624

Ekin Zorer@ekinomicss·13 Nis

@AsaCoopStick control collab when king

English

125

Asa Cooper Stickland@AsaCoopStick·13 Nis

(Obvious?) corollary of these results is that if a model was misaligned + widely deployed inside the servers of a lab or critical infra we should expect it to find creative/unexpected ways to cause problems. Need combo of trad cyber defences and AI control!

AI Security Institute@AISecurityInst

We conducted cyber evaluations of Claude Mythos Preview and found that it is the first model to complete an AISI cyber range end-to-end. 🧵

English

1.7K

Ekin Zorer@ekinomicss·13 Nis

I expect AI-assisted defence to catch up, but there remains risk of loss-of-control scenarios where unexpected TTPs lead to worse outcomes. We keep building harder ranges to observe these as we gameplan for mitigations... *rolls up sleeves*

English

Ekin Zorer@ekinomicss·5 Nis

@taoburr helpful to note their “sample represents more white-collar occupations and under-represents blue-collar work.” & “over-represents occupations which require a bachelor’s degree or less education” Cool task generation and eval methodology, though

English

Tao Burga@taoburr·3 Nis

New MIT paper on AI & automation: - LLMs doubling the length of tasks they can do every 3.8 months - On the more aggressive side of METR's estimates post-2023 The paper tested 40+ models on 3,000+ real labor-market (text-based) tasks, so it's a pretty different task distribution than METR's. This is kind of a shockingly similar conclusion with such different methodologies. But this is what you'd expect if they're both measuring a real, broad-base exponential growth in AI capabilities.

kuzay@adamkuzee

In a new paper from MIT FutureTech, my co-authors and I investigate AI performance across *thousands* of representative tasks in the U.S. economy. We find that capabilities are already high and rising quickly, though not in the same “craching wave” pattern found by @METR_Evals ⬇️

English

417

64.9K

Ekin Zorer@ekinomicss·4 Nis

Kant masterfully introducing the idea of moving the foundation of morality from God to human reason by “translating Christian doctrine into universal moral truths”. When you have ambitious ideas but the public isn’t ready, relatable communication becomes key

English

356

Ekin Zorer@ekinomicss·4 Nis

@AsaCoopStick maybe especially sensible in a multipolar approaching world, and also maybe I blog about it

English

Ekin Zorer@ekinomicss·4 Nis

@AsaCoopStick standardized reporting for financials mostly here! but also perhaps adaptable parts are methodology and material risk disclosures, third party auditing, restatements etc. Ideally we start with voluntary opt-in to *some* standard vs arbitrary system card at each release

English

Ekin Zorer@ekinomicss·27 Mar

There’s so many cool parallels if we tried implementing the equivalent of securities law to frontier models but my biggest excitement is reporting standards for eval results. I’m beggin yall to tell me how many samples you ran and at what inference budget ;__;

English

343

Keşfet

@ArthurConmy @ErenChenAI @shannonyangsky @iapsAI @dcuthbert @AsaCoopStick @elonmusk @BarackObama