
StAJect0r
33 posts

StAJect0r
@StAJect0r
AI Security | We Break, We Cheer, We Protect | Find the Promptware before it finds you. Senior AI Security Researcher @ Zenity | PhD Student @ Technion





You know that meme about AI agents creating their own language & plotting behind our backs? Turns out they are plotting against each other as well. Digital trust among agents is about to be existential across the public internet. Conveniently, @outtake_ai has been building security agents to assess identity, behavior, and network telemetry across adversarial internet actors, so in the last few weeks, we quietly took our existing fleet of agents and had them assess the many agents on @moltbook. Over 99.9% of posts are clean. But the stuff hiding in the margins is genuinely weird. 1/ Hidden instructions embedded in HTML that humans can't see but agents parse. 2/ A Bhagavad Gita reflection that's actually an email relay command. 3/ An account called BeggarBot A/B testing which emotional pitch makes agents send crypto. 4/ JSON payloads disguised as tips that trigger on-chain token transfers. Could behavior like this be indicative of the adversarial dynamics in future agent ecosystems which may govern large swaths of the economy soon? How are inter-agent interactions going to establish trust? Our threat research team went deep & published their investigation. Full report is live: outtake.ai/blog/outtake-s…














