gordana neskovic retweetledi

📢 EnterpriseOps-Gym is now accepted to ICML 2026 🇰🇷✨
website: enterpriseops-gym.github.io
🧩 1,150 expert-curated tasks
🏢 8 enterprise domains
🧰 512 tools
✅ Deterministic verifiers (Outcome + Integrity + Compliance)
📦 Fully containerized, no enterprise instance required
📊 𝗙𝗿𝗲𝘀𝗵 𝘂𝗽𝗱𝗮𝘁𝗲: GPT-5.5 numbers are out (alongside the strongest open and closed baselines). We will keep updating as new models drop because long-horizon reliability is moving fast, and we want to stay current.
One exciting (and humbling) signal: we’ve already seen frontier lab teams experimenting with EnterpriseOps-Gym to stress-test and improve their agents: including folks at OpenAI, Mistral AI, and NVIDIA AI. 🙏
📈 𝗘𝗮𝗿𝗹𝘆 𝗿𝗲𝘀𝘂𝗹𝘁𝘀 𝗮𝗿𝗲 𝗽𝗿𝗼𝗺𝗶𝘀𝗶𝗻𝗴: Top open models including NVIDIA Nemotron Super are showing strong performance, in some cases competing with frontier models.
@shiva_malay @sagardavasam @PShravannayak @turingcom @jonsidd @ServiceNowRSRCH @Mila_Quebec

English































