
Flowith
1.9K posts

Flowith
@flowith
Flowith is an agentic AI workspace that connects your knowledge, creation, and execution in a single flow. Canvas: https://t.co/XfKrOFlIHt OS: https://t.co/qSfnxWeY3i



Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.

Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M - Natively Multimodal from Step Zero API: platform.minimax.io Token Plan: platform.minimax.io/subscribe/toke… 🚀New! MiniMax Code: code.minimax.io Weights & Tech Report in ~10 Days

turns out a lot of you noticed the same thing. don't expect an official response, so i had my agents build something: nerfed.watch → independent benchmarks for codex & claude code every 2 days → problems sourced from TerminalBench2 (easy ones filtered out) → subscribe (free) to get alerted when something gets nerfed first batch is already running. scores drop in 2 days. if this gets 3k subscribers i'll keep it running long-term (benchmarks burn a lot of tokens and that's not free). RT appreciated. let's keep them honest.










