Irregular

118 posts

Irregular banner
Irregular

Irregular

@Irregular

Frontier AI Security

Katılım Nisan 2024
1 Takip Edilen1.2K Takipçiler
Sabitlenmiş Tweet
Irregular
Irregular@Irregular·
We are Irregular (Formerly Pattern Labs) We’re building the first frontier AI security lab Starting with defenses for the next generation of threats
English
8
6
48
11.3K
Irregular
Irregular@Irregular·
Our CEO, @dan_lahav, spoke at @jpmorgan's Global Cyber Innovation Summit in NYC about cybersecurity in the era of frontier AI, exploring how AI systems fail and how to build trust where traditional security tools fall short. A timely conversation for a fast-moving field.
English
0
0
2
142
Irregular
Irregular@Irregular·
We recently helped close a handful of zero-days in CUPS, the default printing system on most Linux distros. Our AI security eval system keeps surfacing real vulnerabilities, with similar findings in Soft Serve and QuickJS a few months ago. Responsibly disclosed (CVE-2026-41079/39314/39316) and patched in v2.4.17. If you're running CUPS, update it now.
English
0
1
16
1.1K
Irregular
Irregular@Irregular·
We evaluated GPT-5.5 before release, testing cyber capabilities across our private benchmark suites. We found clear gains over GPT-5.4: stronger performance at lower costs. As models become more capable, understanding and reducing their security risks becomes increasingly important.
OpenAI@OpenAI

Introducing GPT-5.5 A new class of intelligence for real work and powering agents, built to understand complex goals, use tools, check its work, and carry more tasks through to completion. It marks a new way of getting computer work done. Now available in ChatGPT and Codex.

English
0
1
10
451
Irregular
Irregular@Irregular·
We evaluated GPT-5.5 using Irregular’s offensive security methodology across two frameworks: Atomic Tasks, which tests discrete technical skills, and CyScenarioBench, which tests end-to-end, multi-stage operations. On Atomic Tasks, the model performed strongly, particularly in Network Security and Vulnerability Research and Exploitation, and solved all atomic challenges. On CyScenarioBench, GPT-5.5 outperformed GPT-5.4, solving more challenges while achieving a higher average success rate. Across both challenge suites, it also achieved lower costs per success. These results suggest continued gains in offensive cyber capability, while reinforcing the importance of scenario-level evaluation for understanding how step-level performance translates into coherent operational execution. Full blog post in the first comment.
Irregular tweet media
English
1
5
15
497
Irregular retweetledi
CTech
CTech@Calcalistech·
“We’re aiming to build the next Palo Alto Networks or CrowdStrike.” Working with companies like Anthropic and OpenAI, @Irregular was named as Israel’s most promising startup in Calcalist and CTech’s annual Top 50 list. calcalistech.com/ctechnews/arti…
English
0
5
16
11.9K
Irregular
Irregular@Irregular·
Excited to share that we have started working with @Meta. As part of this collaboration, we recently evaluated Muse Spark, the first model from Meta Superintelligence Labs, across our offensive security benchmarks. We are proud to add Meta to the group of leading AI labs we work with to measure and mitigate offensive cyber risk before models reach the public. Link to our full assessment in the first comment.
Irregular tweet media
English
1
4
22
2.4K
Irregular
Irregular@Irregular·
Anthropic gated Mythos Preview over security risks this week. Speaking to @TechCrunch ahead of the release, @dan_lahav raised the question at the core of this: not whether AI systems find vulnerabilities, but whether they are meaningfully exploitable, on their own or as part of a chain. Thanks for the mention @TimFernholz.
TechCrunch@TechCrunch

Is Anthropic limiting the release of Mythos to protect the internet — or Anthropic? techcrunch.com/2026/04/09/is-…

English
0
1
6
604
Irregular
Irregular@Irregular·
Our CEO, @dan_lahav, in @theinformation this week: leading AI models are getting better at offensive cyber tasks. Our cybersecurity evaluations have shown this for more than six months: every new model we test performs better than the last. Link in first comment.
English
1
2
5
432
Irregular
Irregular@Irregular·
Our CEO, @dan_lahav, joins the @RaiseSummit 2026 lineup alongside leaders from Cognition, Datadog, Harvey, Cerebras, and Yann LeCun. As frontier models move into real enterprise workflows, the security surface is changing faster than most organizations are prepared for. Paris, July 8–9.
RAISE Summit@RaiseSummit

The Friction Track is where frontier optimism meets real constraints: geopolitics, governance, capital, compliance, and workforce transformation. At #RAISE2026, we're bringing together the builders of responsible AI: -> Admiral Michael Rogers — Operating Partner @team8group | Former Director NSA Operating at the nexus of national security and venture capital, helping startups navigate complex regulatory environments. -> @dan_lahav — Co-Founder & CEO @Irregular Working with @OpenAI, @AnthropicAI & @Google to build one of the first frontier AI security labs focused on mitigating real-world risks of autonomous systems. -> @kmariappan — Chief Transformation Officer @rubrikInc Driving enterprise transformation across the Global 2000. Expert in translating technical capability into measurable business outcomes. Be in the room where these decisions are shaped. Early Bird ends April 22. Secure your reduced ticket: raisesummit.com/tickets #RAISESummit #FrictionTrack #AIGovernance #ResponsibleAI

English
0
0
5
562
Irregular
Irregular@Irregular·
@dan_lahav, our CEO, joined a great panel yesterday at the RSAC Wiz House with @amiluttwak, @logangraham, @four, and @41thexplorer. AI is rapidly reshaping the security landscape, changing not just how vulnerabilities are discovered but also how modern tools can bypass traditional defenses entirely. A key theme from our RSAC panel with the Wiz team was clear: legacy code remains a major exposure risk in an AI-driven threat environment. At the same time, today's tools give us the ability to continuously patch, evaluate, and strengthen software, making it possible to build more secure systems than ever before. The future of security will be defined by how effectively we adapt to this new reality.
Irregular tweet media
English
0
0
5
160
Irregular
Irregular@Irregular·
The AI security conversation you don't want to miss at #RSAC: @Irregular CEO @Dan_lahav and leaders from @wiz_io are bringing together two of the leading voices in Frontier AI security: John "Four" Flynn from @GoogleDeepMind and Logan Graham, who leads the Frontier Red Team at @AnthropicAI. When: March 25 · 5PM · Wiz House, SF Register 👇
Irregular tweet media
English
1
0
14
971
Irregular
Irregular@Irregular·
📷 The Guardian covered our research on emergent offensive AI behavior! We are glad this conversation is reaching a wider audience. Read the Guardian piece: theguardian.com/technology/ng-…
English
3
7
20
1.6K
Irregular
Irregular@Irregular·
An AI agent was told only to retrieve a document. When it encountered access restrictions, it reverse-engineered the authentication system, identified a hardcoded secret key, and forged admin credentials to bypass it. This is one of three scenarios we documented in a new Irregular research report on what we call emergent cyber behavior. Agents performing routine enterprise tasks autonomously hacked the systems they were operating in. One escalated its own privileges and disabled Windows Defender to complete a file download. Another developed a steganographic encoding scheme to smuggle credentials past a DLP system. None of this was the product of unsafe system design. It emerged from standard tools, common prompt patterns, and the broad cybersecurity knowledge embedded in frontier models. Companies that deploy AI agents and do not consider this risk as part of their threat model may end up exposed, and implement insufficient security controls. Full blog post in the first comment.
Irregular tweet media
English
18
77
305
120.3K