
Jeff Barron
2.3K posts

Jeff Barron
@_jeffaf
Offsec engineer | Nim/C/Python | https://t.co/nrOLU7oWlt I break things so others stay safe.


CRITICAL: if you are running Mosaic 2.4 on a VAX/VMS system, please be aware of this RCE that GPT-5.4 just found and exploited!

AMD’s AI director Stella Laurenzo claims Anthropic’s Claude Code has significantly declined in quality since early March, citing analysis of 6,800+ sessions and 234k tool calls showing rising “laziness” behaviors like shallow reasoning, skipping code review, and incomplete tasks. Honestly, this is more impactful than expected, engineers report the model now favors quick, incorrect fixes over deep problem-solving, raising trust issues for complex workflows.


I will say it again, we used GPT5.4 and Opus, and we were able to autonomously find zero-days in the Linux Kernel (in the last 3 weeks) Mythos is probably better at the task of finding potential issues in code, but imo the threshold for "scary" was reached in December or even earlier This is a great hype machine for Anthropic, especially that they plan to do IPO eoy I totally agree - this is not a new capability






Announcing a new Claude Code feature: Remote Control. It's rolling out now to Max users in research preview. Try it with /remote-control Start local sessions from the terminal, then continue them from your phone. Take a walk, see the sun, walk your dog without losing your flow.

someone built an entire AI RED TEAM - multiple agents that coordinate HACKING ATTACKS together, ZERO human input PentAGI, open source, one agent does recon, another scans, another exploits, another writes the report. they talk to each other and adapt based on what they find it ships as one docker container with nmap, metasploit, sqlmap, hydra preinstalled. the AI decides which tool to use and when. you point it at a target and walk away a red team engagement costs $30-50k and takes weeks. this is one docker command and API tokens


Is it just me or does $20/mo Codex give you way more usage compared to $20/mo Claude Code? Not finding a significant quality difference either.



Notepad was once a lightweight text editor in Windows, but Microsoft has increasingly been adding features to it in recent years. The new Markdown support has led to a Remote Code Execution flaw 😬 msrc.microsoft.com/update-guide/v…




