Logan Graham

1.3K posts

Logan Graham banner
Logan Graham

Logan Graham

@logangraham

Head of the Frontier Red Team @anthropicai. 🌎 Make things radically good.

the present, moments ago Katılım Haziran 2009
8.1K Takip Edilen19.1K Takipçiler
Geoffrey Irving
Geoffrey Irving@geoffreyirving·
A bittersweet announcement! For family reasons, I will be leaving AISI soon to move back to the Bay Area. I will be starting a new nonprofit alignment research org (more to come). I will miss this place! Here are some reflections about my time at AISI. 🧵❤️
English
15
27
494
46.8K
Logan Graham
Logan Graham@logangraham·
@TheStalwart Probably always relative, but I’m hopeful we can use models to increase the costs for attackers by quite a lot.
English
0
0
16
955
Zephyr
Zephyr@zephyr_z9·
"Within a year, Mythos will probably look quite dumb (relative to other new models)."
Logan Graham@logangraham

A lot of people have been wondering about Mythos, Glasswing, and the vulns we / our partners are fixing. Today, I’m excited for us to start sharing more. (For context, I lead Glasswing @AnthropicAI.) Two independent evaluations this week—from XBOW and the UK AISI—confirm what we've been seeing internally: Claude Mythos Preview is a step change in autonomous cybersecurity capabilities. We need to start preparing fast for a world of models with this level of capabilities. The UK AI Security Institute tested the model we shipped at the launch of Project Glasswing and found Mythos Preview is the first model to solve both of their end-to-end cyber ranges, including one (Cooling Tower) which no model had ever cleared. But attackers (and defenders) have sophistication & cost constraints – Mythos is also the only model that clears every one of their tasks estimated over 8 hours under their deliberately low 2.5M-token cap. XBOW tested it on their offensive security benchmarks, finding "token-for-token, unprecedented precision." It's the only model to succeed at subtle V8 sandbox work. Other Glasswing partners shared similar stories. In a few weeks of testing, Mythos Preview has helped them find many thousands of (estimated) high + critical severity vulnerabilities, sometimes double what they'd normally find in a year. I don't share this to boost Mythos. In fact, this is not about Mythos. It’s about preparing for the coming world of models being better, faster, cheaper, and more creative than some of the best human experts at dual use capabilities. Clearly, we need them supporting defenders as widely as can be done safely – and especially the least resourced ones. Within a year, Mythos will probably look quite dumb (relative to other new models). And others may release openly available or unguardrailed models of Mythos-level capabilities. We started Project Glasswing because capabilities like Mythos Preview's won't stay rare, or stay in careful hands. We are bringing it to defenders as fast as we responsibly can, while working to figure out, for example, the right safeguards and patching & disclosure processes. Also, to be clear, compute has never been a limiter in our rollout. Expect a fuller update on our Glasswing work in the coming days. XBOW report: xbow.com/blog/mythos-of… UK AISI report: aisi.gov.uk/blog/how-fast-…

English
10
5
454
64.7K
Logan Graham
Logan Graham@logangraham·
A lot of people have been wondering about Mythos, Glasswing, and the vulns we / our partners are fixing. Today, I’m excited for us to start sharing more. (For context, I lead Glasswing @AnthropicAI.) Two independent evaluations this week—from XBOW and the UK AISI—confirm what we've been seeing internally: Claude Mythos Preview is a step change in autonomous cybersecurity capabilities. We need to start preparing fast for a world of models with this level of capabilities. The UK AI Security Institute tested the model we shipped at the launch of Project Glasswing and found Mythos Preview is the first model to solve both of their end-to-end cyber ranges, including one (Cooling Tower) which no model had ever cleared. But attackers (and defenders) have sophistication & cost constraints – Mythos is also the only model that clears every one of their tasks estimated over 8 hours under their deliberately low 2.5M-token cap. XBOW tested it on their offensive security benchmarks, finding "token-for-token, unprecedented precision." It's the only model to succeed at subtle V8 sandbox work. Other Glasswing partners shared similar stories. In a few weeks of testing, Mythos Preview has helped them find many thousands of (estimated) high + critical severity vulnerabilities, sometimes double what they'd normally find in a year. I don't share this to boost Mythos. In fact, this is not about Mythos. It’s about preparing for the coming world of models being better, faster, cheaper, and more creative than some of the best human experts at dual use capabilities. Clearly, we need them supporting defenders as widely as can be done safely – and especially the least resourced ones. Within a year, Mythos will probably look quite dumb (relative to other new models). And others may release openly available or unguardrailed models of Mythos-level capabilities. We started Project Glasswing because capabilities like Mythos Preview's won't stay rare, or stay in careful hands. We are bringing it to defenders as fast as we responsibly can, while working to figure out, for example, the right safeguards and patching & disclosure processes. Also, to be clear, compute has never been a limiter in our rollout. Expect a fuller update on our Glasswing work in the coming days. XBOW report: xbow.com/blog/mythos-of… UK AISI report: aisi.gov.uk/blog/how-fast-…
AI Security Institute@AISecurityInst

Our cyber range results illustrate this step-up. Since our first Mythos evaluation, we received access to a newer Mythos Preview checkpoint. On a 32-step corporate network attack we estimate takes a human expert ~20 hours, this checkpoint completes the full attack in 6 /10 attempts.

English
63
199
1.2K
489.7K
Hasnain Lakhani
Hasnain Lakhani@mhlakhani·
It has been an honor helping here from the sidelines and even having seen some of this work first hand I don’t think I am ready for the implications and that scares me. Huge wake up calls are needed
Logan Graham@logangraham

A lot of people have been wondering about Mythos, Glasswing, and the vulns we / our partners are fixing. Today, I’m excited for us to start sharing more. (For context, I lead Glasswing @AnthropicAI.) Two independent evaluations this week—from XBOW and the UK AISI—confirm what we've been seeing internally: Claude Mythos Preview is a step change in autonomous cybersecurity capabilities. We need to start preparing fast for a world of models with this level of capabilities. The UK AI Security Institute tested the model we shipped at the launch of Project Glasswing and found Mythos Preview is the first model to solve both of their end-to-end cyber ranges, including one (Cooling Tower) which no model had ever cleared. But attackers (and defenders) have sophistication & cost constraints – Mythos is also the only model that clears every one of their tasks estimated over 8 hours under their deliberately low 2.5M-token cap. XBOW tested it on their offensive security benchmarks, finding "token-for-token, unprecedented precision." It's the only model to succeed at subtle V8 sandbox work. Other Glasswing partners shared similar stories. In a few weeks of testing, Mythos Preview has helped them find many thousands of (estimated) high + critical severity vulnerabilities, sometimes double what they'd normally find in a year. I don't share this to boost Mythos. In fact, this is not about Mythos. It’s about preparing for the coming world of models being better, faster, cheaper, and more creative than some of the best human experts at dual use capabilities. Clearly, we need them supporting defenders as widely as can be done safely – and especially the least resourced ones. Within a year, Mythos will probably look quite dumb (relative to other new models). And others may release openly available or unguardrailed models of Mythos-level capabilities. We started Project Glasswing because capabilities like Mythos Preview's won't stay rare, or stay in careful hands. We are bringing it to defenders as fast as we responsibly can, while working to figure out, for example, the right safeguards and patching & disclosure processes. Also, to be clear, compute has never been a limiter in our rollout. Expect a fuller update on our Glasswing work in the coming days. XBOW report: xbow.com/blog/mythos-of… UK AISI report: aisi.gov.uk/blog/how-fast-…

English
1
1
10
1.3K
Logan Graham
Logan Graham@logangraham·
@fluxxrider @AnthropicAI UK AISI previously tested a partially trained version. The latest results are on the actual Mythos Preview -- the model as it was on the day we launched Glasswing (on April 7th).
English
3
4
78
13.2K
Flux
Flux@fluxxrider·
@logangraham @AnthropicAI is there a newer finetune of mythos? Why are there two versions of it being tested by AISI (additionally, METR used the “early” version)
English
1
0
2
6.1K
Logan Graham
Logan Graham@logangraham·
@hendrycks Yeah. That's probably the scenario we think about the most. The entire question is then how to smooth the transition as much as possible. (which I think could require some unprecedented innovations in security)
English
3
0
17
999
Dan Hendrycks
Dan Hendrycks@hendrycks·
@logangraham Eventually could be a long time, especially in critical infrastructure, much of which is running on Windows 7.
English
1
0
24
1.8K
Logan Graham
Logan Graham@logangraham·
The team I lead is part of The Anthropic Institute. The way I think about the Institute is "applied weird blue sky research". So far we've got a good track record of tackling some of the most important ideas -- cyber, self-improvement, robots, national security.
Anthropic@AnthropicAI

We’re sharing the research agenda of The Anthropic Institute, or TAI. TAI will focus on four areas: 1) Economic diffusion 2) Threats and resilience 3) AI systems in the wild 4) AI-driven R&D Read the full agenda: anthropic.com/research/anthr…

English
10
9
227
26K
Logan Graham
Logan Graham@logangraham·
Some main learnings so far: 1) Use models in security today to get a glimpse of the future. 2) Start finding and fixing things. 3) Figure out how to scale it when more powerful models arrive. Claude Security is a great way to do that!
English
1
1
17
1.5K
Logan Graham
Logan Graham@logangraham·
Something I've seen from Mythos / Glasswing is that one of the things companies/orgs/maintainers need most is a first model-driven boost in finding + fixing vulns. That's a big reason we're scaling Claude Security today for more users: x.com/claudeai/statu…
Claude@claudeai

Claude Security is now in public beta for Claude Enterprise customers. Claude scans your codebase for vulnerabilities, validates each finding to cut false positives, and suggests patches you can review and approve.

English
7
2
88
15.6K
Logan Graham
Logan Graham@logangraham·
Also, if you're a security researcher / leader really motivated by the mission of "solve the whole AI cyber problem", you should apply to Anthropic. We're looking e.g. for vulnerability researchers, senior security researchers and engineers, AI security research leaders, etc.
Logan Graham@logangraham

Privileged to help lead this. Thankful to our partners. Mythos is an extraordinary model. But it is not about the model. It's about what the world needs to do to prepare for a future of models that are extremely good at cybersecurity. This is the start.

English
39
32
548
60.1K
Logan Graham
Logan Graham@logangraham·
@FFmpeg @aakashxsh Our policy right now is we try to send a patch with every disclosure (unless it can't be reasonably done). We want to practice what we think is a gold standard to be most helpful. At least until we all figure out a better equilibrium.
English
2
1
107
14.8K
Logan Graham
Logan Graham@logangraham·
@sporadica I mean, we think there are good arguments for it and we seriously hope so. The transition period could be bad, though, if we don't work hard. Also, no one truly knows.
English
3
1
93
2.7K
spor
spor@sporadica·
this > god bless Anthropic
spor tweet media
spor@sporadica

@theojaffee the offense-defense balance varies btwn different technologies & environments. BUT the guiding principle of AI development is that it makes intelligence cheaper and more abundant — this is fundamentally a dynamic that favors the defense.

English
4
6
182
20.6K
Logan Graham
Logan Graham@logangraham·
@creatine_cycle @theo I totally missed you'd covered our work. Thought this was a good piece and well-calibrated thinking!
English
1
0
8
2.7K
atlas
atlas@creatine_cycle·
"my guess is that we are three to maybe nine months away from every piece of software we rely on being exploitable by most models, even open weight ones" - @theo "we're about to hit a point where every single hospital, library and local city government and their google accounts are exploitable "by AI" and there is going to be terrible legislation proposed as a result"
Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English
27
65
958
175.5K
Kevin Roose
Kevin Roose@kevinroose·
I spoke to Anthropic execs about the new model, which they called a "reckoning" for cybersecurity. They claim it has already found vulnerabilities in every major operating system and web browser, including some that "literally decades of security researchers" didn't find.
Kevin Roose tweet media
English
15
67
712
126.3K
Kevin Roose
Kevin Roose@kevinroose·
NEWS: Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Instead, it is starting a 40-company coalition, Project Glasswing, to allow cybersecurity defenders a head start in locking down critical software. nytimes.com/2026/04/07/tec…
English
188
873
5.4K
1.6M
Logan Graham
Logan Graham@logangraham·
This release is also sort of a responsible disclosure. Models are going to get better, and alongside that will come cheap, fast exploitation capabilities. We need to prepare for that world. red.anthropic.com/2026/mythos-pr…
English
2
6
80
5.8K
Logan Graham
Logan Graham@logangraham·
Our team has been pointing Mythos Preview at every security task they can. It's really good. One big change is models of this capability class can write exploits -- sometimes sophisticated ones. Mostly, we want you to know this may soon be the new reality.
English
4
1
87
6.3K
Logan Graham
Logan Graham@logangraham·
Privileged to help lead this. Thankful to our partners. Mythos is an extraordinary model. But it is not about the model. It's about what the world needs to do to prepare for a future of models that are extremely good at cybersecurity. This is the start.
Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English
53
52
1.1K
129K