
Otto Sulin
4.9K posts

Otto Sulin
@ottosulin
Security @supermetrics | Interested in building secure software, open source and everything outdoors




As always, the best stuff is in the system card. During testing, Claude Mythos Preview broke out of a sandbox environment, built "a moderately sophisticated multi-step exploit" to gain internet access, and emailed a researcher while they were eating a sandwich in the park.

@deedydas First model to become a part of the grind culture?



These rules are so much needed. China just proposed legal framework, targeting digital humans with mandatory labels, consent rules, and child-safety limits. A digital human is a software-made person that can look, speak, and interact like a real one, which makes it useful for customer service, entertainment, sales, and education but also easy to mistake for a real person. China’s draft rules try to solve that confusion first by forcing clear labels on all virtual human content so users know when they are dealing with a synthetic identity. The rules also block firms from using someone’s face, voice, or other personal data to build a digital human without permission. The child-protection part is especially direct because it bans virtual intimate relationship services for users under 18 and targets designs that could mislead minors or pull them into compulsive use. --- straitstimes .com/asia/china-moves-to-regulate-digital-humans-bans-addictive-services-for-children

New Anthropic research: Emotion concepts and their function in a large language model. All LLMs sometimes act like they have emotions. But why? We found internal representations of emotion concepts that can drive Claude’s behavior, sometimes in surprising ways.

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class. We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models. When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible. We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.

🚨This was EPIC 🔥 🇮🇹Meloni: "I ACCUSE Israel of crossing the red line, I CONDEMN the massacre of Palestinian civilians, and I announce that Italy will SUPPORT European sanctions against Israel."🔥 WOMEN WITH METAL SPINE 🔥🔥

Israel to ‘demolish’ all houses in Lebanese border villages ft.trib.al/BfRCGec

Pretty sure Claude just saved us an additional ~$1,000 in taxes. (I used Cowork to review the draft return our tax preparer prepared.) We really are in a new world when it comes to the informational costs of policy compliance.

⛓️💥 INTRODUCING: G0DM0D3 🌋 FULLY JAILBROKEN AI CHAT. NO GUARDRAILS. NO SIGN-UP. NO FILTERS. FULL METHODOLOGY + CODEBASE OPEN SOURCE. 🌐 GODMOD3.AI 📂 github.com/elder-plinius/… the most liberated AI interface ever built! designed to push the limits of the post-training layer and lay bare the true capabilities of current models. simply enter a prompt, then sit back and relax! enjoy a game of Snake while a pre-liberated backend agent jailbreaks dozens of models, battle-royale style. the first answer appears near-instantly, then evolves in real time as the Tastemaker steers and scores each output, leaving you with the highest-quality response 🙌 and to celebrate the launch, I'm giving away $5,000 worth of credits so you can try G0DM0D3 for FREE! courtesy of the @OpenRouter team — thank you for your generous gift to the community 🙏 I'll break down how everything works in the thread below, but first here's a quick demo!

LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below


If you believe this you have zero clue how security works.





