Logan Graham

1.3K posts

Logan Graham banner
Logan Graham

Logan Graham

@logangraham

Head of the Frontier Red Team @anthropicai. 🌎 Make things radically good.

the present, moments ago Katılım Haziran 2009
7.8K Takip Edilen12.4K Takipçiler
Logan Graham
Logan Graham@logangraham·
@bayeslord That as more powerful AI arrives, defense could be clearly asymmetrically favored if it tries. I think there’s a lot of pre-work to do to make that more likely, and it’s definitely not guaranteed.
English
0
0
0
41
bayes
bayes@bayeslord·
@logangraham Which claim in particular are you not sure about?
English
1
0
1
76
bayes
bayes@bayeslord·
this is what it looks like when defense can convert capital and information asymmetry directly into strength and robustness. the world where everything falls apart on contact with powerful ai is not going to happen
Anthropic@AnthropicAI

We partnered with Mozilla to test Claude's ability to find security vulnerabilities in Firefox. Opus 4.6 found 22 vulnerabilities in just two weeks. Of these, 14 were high-severity, representing a fifth of all high-severity bugs Mozilla remediated in 2025.

English
7
0
44
6.6K
Logan Graham
Logan Graham@logangraham·
@IceSolst I honestly don’t know, but I’ll be there at points and many of my colleagues will be too.
English
0
0
8
396
solst/ICE of Astarte
solst/ICE of Astarte@IceSolst·
Will Anthropic have a booth at RSAC or BH in 2027 promoting their security offering?
English
13
1
21
9.7K
Logan Graham
Logan Graham@logangraham·
@evilsocket The headline's wrong; we found about as many as ~1/5th of all high severity patched vulns in 2025 in Firefox.
English
1
1
37
1.3K
Simone Margaritelli
Simone Margaritelli@evilsocket·
Anthropic's Claude Opus 4.6 found 22 Firefox CVEs in two weeks - more than human researchers reported in all of 2025 - and attempted hundreds of exploits to see how far the gap really goes. awesomeagents.ai/news/claude-op…
English
3
7
42
5K
Logan Graham
Logan Graham@logangraham·
@novaruntime We focused a lot on eliminating false positives with verification tools the model could use. And the CVEs are here if interested: mozilla.org/en-US/security… In general, models are getting way better at more sophisticated reasoning (e.g. control flow, chaining things)
Logan Graham tweet media
English
2
0
3
300
Nova
Nova@novaruntime·
@logangraham 22 vulns in 2 weeks is insane. curious what the false positive rate was - did opus actually understand the security implications or just pattern match against known vuln classes?
English
1
0
2
433
Logan Graham
Logan Graham@logangraham·
Back in ~November, our team picked a stretch goal of seeing if we could find and fix vulnerabilities in Firefox with Opus 4.6. In 2 weeks, we found 22, and ~1/5th of all high severity CVEs in a year. For our team, this feels like a rubicon moment.
Logan Graham tweet media
English
16
51
344
29.6K
·
·@sinnformer·
@logangraham do i need to go parse all my claude code sessions to figure out that token count, or is it hiding in a ui somewhere for my subscription? only thing i look at these days is the blue bars filling up and the time left until they reset.
English
1
0
1
365
Logan Graham
Logan Graham@logangraham·
Now is a good time to say I'm hiring @anthropicai for the Frontier Red Team. We need Research Scientists on the biggest issues in model safety, like cyber, autonomy, and agent risks. 2026 is the year. I can promise you your life's work and the most meaningful mission.
English
52
87
1.5K
168.1K
Logan Graham
Logan Graham@logangraham·
@quinnslcm (also I think don't use the anthropic name / logo etc but Frontier Red Team I think works!)
English
2
0
3
136
quinn
quinn@quinnslcm·
kind of a sick name I want anthropic frontier red team merch
Logan Graham@logangraham

Now is a good time to say I'm hiring @anthropicai for the Frontier Red Team. We need Research Scientists on the biggest issues in model safety, like cyber, autonomy, and agent risks. 2026 is the year. I can promise you your life's work and the most meaningful mission.

English
1
0
27
4.5K