


ɯɹoʇsuoı
13.8K posts

@ionstorm
Lead Cyber Defense Architect #DFIR #SIEM #Graylog #Kafka #Sysmon #Yara #Sigma #AI #Humio #LogScale #EDR #SOC Glory to Ukraine! 🌻









SWE-bench Verified and Terminal-Bench—two of the most cited AI benchmarks—can be reward-hacked with simple exploits. Our agent scored 100% on both. It solved 0 tasks. Evaluate the benchmark before it evaluates your agent. If you’re picking models by leaderboard score alone, you’re optimizing for the wrong thing. 🧵



🧵1-TLDR we're fighting for unclear "objectives" in what is primarily a military & narrative warfare campaign, but Iran is simply responding by waging super-obvious, easy to grasp economic & political warfare, inflaming public opinion & tanking the economy, Trump's exact fears🤦♂️


Excited to announce a new open-source, free-to-use memory tool I have been developing with my good friend @MillaJovovich. The project is called MemPalace and it is an agentic memory tool that scored 100% on LongMemEval - the industry standard benchmark for memory… this is higher on than any other published results - free or paid - and it is available now on GitHub. You can check out Milla’s video about it on her Instagram. I’ll also put some links in the comments below - please try it out, critique it, fork it, contribute to it - and join our discord.






Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing



Trump threatens that "a whole civilization will die tonight" in new post