Maksim
37.6K posts

Maksim
@MaksimXBT
building an app that grow your wealth by doing nothing









AI agents are already going wild, but today’s red-teaming tools for them are still like toys 😢 🔥👽 After spending 20 months and $120K API credits, we are excited to finally open-source DecodingTrust-Agent Platform (DTap): the first controllable, realistic simulation platform for advanced AI agent red-teaming !! 🌍 DTap simulates 50+ real-world environments across 14 high-stakes domains, with realistic agent interfaces replicated from their official MCPs and GUIs. The environments are full-stack, interactive, fully parallelizable, and can be easily configured to reproduce arbitrary real-world attack scenarios, making agent red-teaming scalable and highly transferable to deployment settings. 🔥We also release DTap-Bench, a large-scale benchmark with ~7K agent red-teaming tasks and ~4K policy-grounded malicious goals. Each red-teaming task includes a sophisticated attack sequence across environment-, tool-, skill-, prompt-level injections, as well as their compositions, plus a handcrafted verifiable judge that checks the actual consequences in the environment. Using DTap-Bench, we evaluate popular agent frameworks and backbone models across diverse policies, risks, threat models, and attack strategies, revealing systematic vulnerabilities and zero-days in today’s agents! Paper link: arxiv.org/pdf/2605.04808 Platform + benchmark + code: decodingtrust-agent.com Join our Discord: discord.gg/V4fG6NcVc Read more below 👇

You can now power your Hermes Agent, if using OpenAI models, with codex as the runtime for the core tools that it offers, with the flip of a switch with the new Codex runtime integration!


Introducing the Open MM-RL Dataset. A PhD-level multimodal STEM benchmark built for verifiable reasoning across physics, chemistry, biology, and math. Four STEM domains, one dataset -Physics: Quantum and Particle Physics, Condensed Matter and Materials, Electromagnetism, Photonics, and Plasma Systems, Astrophysics and Space Physics -Mathematics: Algebra and Structure, Discrete Mathematics, Analysis and Continuous Mathematics, Probability and Geometry -Biology: Evolutionary Systems, Molecular Mechanisms, Cellular Processes and Neural Biology -Chemistry: Chemical Structure, Reaction Mechanisms, Synthesis, Spectroscopy and Properties We're raising the bar.

🔥 New paper: Language Modeling with Hyperspherical Flows Recent flow language models (FLMs) all use Gaussian noise. Makes sense for images, but not necessarily for text 🫠 We propose to add noise by rotating embeddings on 𝕊^{d−1} instead 🌐 w/ @caglarml (1/9)




‼️🇺🇸 CoreWeave allegedly breached: full infrastructure access claimed against the US GPU cloud provider that powers OpenAI workloads A threat actor claims to have pulled full infrastructure access from CoreWeave, the US-based GPU cloud provider that went public in 2025 with revenue exceeding $500 million and is one of the primary compute providers for OpenAI workloads. The actor describes the access as wide open with zero authentication required, stating they cannot determine whether the exposure represents gross negligence or a honeypot. The claimed access spans multiple internal notebook servers with root shells across regions, full cloud account credentials, the central monitoring stack, customer data storage, internal infrastructure topology, and long-term persistence mechanisms. The post is currently unverified. ▸ Actor: macaroni ▸ Sector: Cloud Computing / GPU Infrastructure / AI Compute ▸ Type: Infrastructure Access Claim (unverified) ▸ Records: Full infrastructure access claim, no record count specified ▸ Country: United States ▸ Date: 13/05/2026 Compromised data: ▪ Multiple internal notebook servers with root shells across multiple regions ▪ Cloud account credentials and data access roles, including permanent IAM keys with sts:AssumeRole and temporary keys from 4 accounts ▪ Central monitoring dashboard with full Grafana admin access, every dashboard, Loki logs, Prometheus metrics, and live GPU telemetry ▪ Customer data storage including S3 buckets, EBS snapshots, and workload logs reportedly containing personal and financial records ▪ Internal infrastructure topology including Kubernetes API, Docker registry, Jenkins, ArgoCD, PostgreSQL, and Redis (no authentication), with a full network map ▪ Long-term persistence including deployed SSH keys, backdoor user accounts, and identified IAM persistence paths Stop guessing what's redacted. Subscribers see everything → darkwebinformer.com/pricing


You've been asking for this one... Now in preview: Codex in the ChatGPT mobile app. Start new work, review outputs, steer execution, and approve next steps, all from the ChatGPT mobile app. Codex will keep running on your laptop, Mac mini, or devbox.







🚨Typical RL algorithms and on-policy distillation methods are blind samplers: they use privileged info to score rollouts, but not to *find* them. We ask: can we use privileged info to *actively sample* the rollouts RL wishes it can stumble upon with compute? ⤵️ Pedagogical RL


Welcome to the Summer of Judgment. We're giving away free icecream in SF for the rest of the week! Check out our schedule at judgmentlabs.ai/icecream Today, we'll be in SoMa from 11am-3pm, right outside of the DoorDash headquarters in SF (303 2nd St) Today's Flavors: - The Prime (@PrimeIntellect) - The DoorDash (@DoorDash) - The Mercor (@mercor_ai) - Claude Au Lait (@AnthropicAI) - Fudgement (@JudgmentLabs) - JudgMint (@JudgmentLabs) SF IS SO BACK 🚀

We are launching MolmoAct2, a fully open Action Reasoning Model for real-world robot deployment: open weights, training code, action tokenizer, and complete training data. The core move is to couple a spatial VLM backbone to a continuous action expert. 🧵👇












