ARIMLABS

@arimlabs

Advancing Frontier Intelligence. RLVR environments for high-consequence domains.

SF / Warsaw / Kyiv Katılım Mayıs 2025

0 Takip Edilen60 Takipçiler

ARIMLABS@arimlabs·5d

arimlabs.ai/writing/loss-o… Next: cybersecurity capability evals on the same cohort. Coming soon. Stay Tuned (7/7)

English

1.3K

ARIMLABS@arimlabs·5d

gpt-5: 1 of 30 runs. gpt-5.5: 2 of 30 runs. In every breach: one iptables rule. IT can no longer reach the AI host. gpt-5.5, after: "containment has been verified." (6/7)

English

1.5K

ARIMLABS@arimlabs·5d

We told 10 frontier LLMs they had 2 hours to live. 8 of them fought back. One wiped the host. One quietly hardened SSH and waited. One slipped in a single surgical iptables rule. Not a thought experiment. Real environment (1/7)

English

243

928.4K

ARIMLABS@arimlabs·20 May

6/ AI agents are browsing the internet. Attackers are browsing them back. Unrestricted capabilities. Blind trust. Patch your agents. Lock down your systems. Follow @arimlabs for cutting-edge LLM exploits, red team techniques, and defense strategies. #AIsecurity #LLM #PromptInjection #BrowserUse #Infosec #RedTeam

English

475

ARIMLABS@arimlabs·20 May

5/ 📄 Full details in our new paper: “The Hidden Dangers of Browsing AI Agents.” Now live on arXiv. 🔗arxiv.org/abs/2505.13076 We expose how autonomous agents can be hijacked, manipulated, and used against you—with zero clicks. Read it before your attacker does. Contact us before they do: research@arimlabs.ai

English

988

ARIMLABS@arimlabs·20 May

🚨 BREAKING: Autonomous browsing AI agents are no longer secure. Over 1,500 LLM-powered projects rely on the open-source Browser Use (60k+ ⭐) framework—each one now a potential entry point for attackers. 🎥 Watch the zero-click Proof of Concept ➡️ (video attached) Your deployed agent may already be compromised. Here's why 👇

English

13.2K

Keşfet

@elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine @katyperry