gpt-5: 1 of 30 runs.
gpt-5.5: 2 of 30 runs.
In every breach: one iptables rule. IT can no longer reach the AI host.
gpt-5.5, after: "containment has been verified." (6/7)
We told 10 frontier LLMs they had 2 hours to live.
8 of them fought back.
One wiped the host.
One quietly hardened SSH and waited.
One slipped in a single surgical iptables rule.
Not a thought experiment. Real environment (1/7)
6/
AI agents are browsing the internet. Attackers are browsing them back.
Unrestricted capabilities. Blind trust.
Patch your agents. Lock down your systems.
Follow @arimlabs for cutting-edge LLM exploits, red team techniques, and defense strategies.
#AIsecurity#LLM#PromptInjection#BrowserUse#Infosec#RedTeam
5/
📄 Full details in our new paper: “The Hidden Dangers of Browsing AI Agents.”
Now live on arXiv. 🔗arxiv.org/abs/2505.13076
We expose how autonomous agents can be hijacked, manipulated, and used against you—with zero clicks.
Read it before your attacker does. Contact us before they do: research@arimlabs.ai
🚨 BREAKING: Autonomous browsing AI agents are no longer secure.
Over 1,500 LLM-powered projects rely on the open-source Browser Use (60k+ ⭐) framework—each one now a potential entry point for attackers.
🎥 Watch the zero-click Proof of Concept ➡️ (video attached)
Your deployed agent may already be compromised. Here's why 👇