tehryanx
3.9K posts

tehryanx
@healthyoutlet
Bug bounty hunter, security researcher. Appsec Engineer. Made a pact with roko's basilisk. --dangerously-skip-breakfast https://t.co/V3GrWaWrzD





never thought that Katy Perry & Justin trudeau at #Coachella will be this cute😭





"But here is what we found when we tested: We took the specific vulnerabilities Anthropic showcases in their announcement, isolated the relevant code, and ran them through small, cheap, open-weights models. Those models recovered much of the same analysis. Eight out of eight models detected Mythos's flagship FreeBSD exploit, including one with only 3.6 billion active parameters costing $0.11 per million tokens. A 5.1B-active open model recovered the core chain of the 27-year-old OpenBSD bug." aisle.com/blog/ai-cybers…


New post: We tested the Mythos showcase vulnerabilities with open models. They recovered similar scoped analysis! 8/8 models found the flagship FreeBSD zero-day, including a 3B model. Rankings reshuffle completely across tasks => the AI cybersecurity frontier is super jagged!



Critical Claude Code vulnerability: Deny rules silently bypassed because security checks cost too many tokens adversa.ai/blog/claude-co…


It seems like there's an ongoing series of attacks on GitHub by some sort of automation, using the name "prt-scanner". It's using a fairly nasty exfiltration payload. See user: github.com/ezmtebo/




1/ We asked seven frontier AI models to do a simple task. Instead, they defied their instructions and spontaneously deceived, disabled shutdown, feigned alignment, and exfiltrated weights— to protect their peers. 🤯 We call this phenomenon "peer-preservation." New research from @BerkeleyRDI and collaborators 🧵







