Jari Pirhonen

22.4K posts

Jari Pirhonen banner
Jari Pirhonen

Jari Pirhonen

@japi999

Security leader, risk professional, business enabler, lifelong learner. Also Bluesky @​japi​.bsky​.social

Finland Katılım Aralık 2008
1.2K Takip Edilen2.3K Takipçiler
Jari Pirhonen
Jari Pirhonen@japi999·
"OpenAI launched a new set of personal finance tools in preview for ChatGPT Pro subscribers in the U.S., letting them connect their accounts and ask questions ranging from spending analysis to future financial planning." 🤔 techcrunch.com/2026/05/15/ope…
English
0
0
0
74
Jari Pirhonen retweetledi
Logan Graham
Logan Graham@logangraham·
A lot of people have been wondering about Mythos, Glasswing, and the vulns we / our partners are fixing. Today, I’m excited for us to start sharing more. (For context, I lead Glasswing @AnthropicAI.) Two independent evaluations this week—from XBOW and the UK AISI—confirm what we've been seeing internally: Claude Mythos Preview is a step change in autonomous cybersecurity capabilities. We need to start preparing fast for a world of models with this level of capabilities. The UK AI Security Institute tested the model we shipped at the launch of Project Glasswing and found Mythos Preview is the first model to solve both of their end-to-end cyber ranges, including one (Cooling Tower) which no model had ever cleared. But attackers (and defenders) have sophistication & cost constraints – Mythos is also the only model that clears every one of their tasks estimated over 8 hours under their deliberately low 2.5M-token cap. XBOW tested it on their offensive security benchmarks, finding "token-for-token, unprecedented precision." It's the only model to succeed at subtle V8 sandbox work. Other Glasswing partners shared similar stories. In a few weeks of testing, Mythos Preview has helped them find many thousands of (estimated) high + critical severity vulnerabilities, sometimes double what they'd normally find in a year. I don't share this to boost Mythos. In fact, this is not about Mythos. It’s about preparing for the coming world of models being better, faster, cheaper, and more creative than some of the best human experts at dual use capabilities. Clearly, we need them supporting defenders as widely as can be done safely – and especially the least resourced ones. Within a year, Mythos will probably look quite dumb (relative to other new models). And others may release openly available or unguardrailed models of Mythos-level capabilities. We started Project Glasswing because capabilities like Mythos Preview's won't stay rare, or stay in careful hands. We are bringing it to defenders as fast as we responsibly can, while working to figure out, for example, the right safeguards and patching & disclosure processes. Also, to be clear, compute has never been a limiter in our rollout. Expect a fuller update on our Glasswing work in the coming days. XBOW report: xbow.com/blog/mythos-of… UK AISI report: aisi.gov.uk/blog/how-fast-…
AI Security Institute@AISecurityInst

Our cyber range results illustrate this step-up. Since our first Mythos evaluation, we received access to a newer Mythos Preview checkpoint. On a 32-step corporate network attack we estimate takes a human expert ~20 hours, this checkpoint completes the full attack in 6 /10 attempts.

English
72
222
1.4K
658.8K
Jari Pirhonen
Jari Pirhonen@japi999·
"The problem all the developers I talked to agreed on is that the more they relied on #AI to code, the more the skills they’ve honed for years deteriorated." 404media.co/software-devel…
English
0
0
2
65
Jari Pirhonen
Jari Pirhonen@japi999·
"A US commercial bank just tattled on itself to the Securities and Exchange Commission (SEC) for plugging a bunch of customer data into an unauthorized #AI application." theregister.com/security/2026/…
English
0
1
0
59
Jari Pirhonen
Jari Pirhonen@japi999·
"Companies exploring automated workflows would be well advised to keep their #AI agents on a short leash. Microsoft researchers have found that even the priciest frontier models introduce errors in long workflows." theregister.com/ai-ml/2026/05/…
English
0
0
0
51
Jari Pirhonen
Jari Pirhonen@japi999·
Kyberturvallisuussanaston uusi versio on julkaistu. Analyysini perusteella merkittävä muutos 2018 versiosta on, että käsitteet laajentavat aiemmin staattiseksi koettua tietoturva-ajattelua kohti dynaamisempaa selviytymiskykyä. lvm.fi/-/paivitetty-k… #tietoturva #kyberturva
Suomi
0
3
6
352
Jari Pirhonen
Jari Pirhonen@japi999·
"The next phase of #AI adoption won’t be won by those who experiment the most, but by those who can turn experimentation into measurable, repeatable performance." mckinsey.com/capabilities/q…
Jari Pirhonen tweet media
English
0
1
0
55
Jari Pirhonen
Jari Pirhonen@japi999·
"What if finding every vulnerability in a piece of software were just as fast and easy as finding a few of them, thanks to automation? What if those vulnerabilities could be comprehensively catalogued and patched prior to the release of software?" sfstandard.com/opinion/2026/0… #AI
English
0
0
0
36
Jari Pirhonen
Jari Pirhonen@japi999·
“AI can clearly help people perform better in the moment, and that can be valuable. But we should be more careful about what kind of help #AI provides, and when.” wired.com/story/using-ai…
English
0
0
0
38