mike
424 posts





MiMo-V2-Pro & Omni & TTS is out. Our first full-stack model family built truly for the Agent era. I call this a quiet ambush — not because we planned it, but because the shift from Chat to Agent paradigm happened so fast, even we barely believed it. Somewhere in between was a process that was thrilling, painful, and fascinating all at once. The 1T base model started training months ago. The original goal was long-context reasoning efficiency. Hybrid Attention carries real innovation, without overreaching — and it turns out to be exactly the right foundation for the Agent era. 1M context window. MTP inference for ultra-low latency and cost. These architectural decisions weren't trendy. They were a structural advantage we built before we needed it. What changed everything was experiencing a complex agentic scaffold — what I'd call orchestrated Context — for the first time. I was shocked on day one. I tried to convince the team to use it. That didn't work. So I gave a hard mandate: anyone on MiMo Team with fewer than 100 conversations tomorrow can quit. It worked. Once the team's imagination was ignited by what agentic systems could do, that imagination converted directly into research velocity. People ask why we move so fast. I saw it firsthand building DeepSeek R1. My honest summary: — Backbone and Infra research has long cycles. You need strategic conviction a year before it pays off. — Posttrain agility is a different muscle: product intuition driving evaluation, iteration cycles compressed, paradigm shifts caught early. — And the constant: curiosity, sharp technical instinct, decisive execution, full commitment — and something that's easy to underestimate: a genuine love for the world you're building for. We will open-source — when the models are stable enough to deserve it. From Beijing, very late, not quite awake.

One last go of press. One last premiere later today. Hopefully not the last set of selfies here.


I’m so very very proud present my greatest work yet ❤️🔥🎥 WAR FOREVER • Part One I hope this is inspiration to those just starting, those questioning it, and us veterans to celebrate how far we’ve come in so short of time. We are there. We are at cinema grade, no question. It may have been only a few years but for those of us in the grind everyday this is what we did it for, this is what we stuck in there learning and pushing and struggling for. I am so proud of this and the other work I have coming. Inching ever closer tot he dream to direct a feature film one day ❤️🎥. Thank you to everyone who checks it out and supports me and @NAKIDpictures + @stages_ai LFG! 👾🔥



I created a Claude Skill that make beautiful slides on the web. The world hasn't woken up to the fact that code can create much better slides than most PPT tools. - Claude interviews you first about aesthetics, then generate a few directions to "show not tell", and you can pick your favorite - Cool transitions and animations - Interactive hover states and cursor effects - Auto-fits on any screen - Supports converting existing PPTX files to web-based slides; preserves original images and brand assets I asked Claude to make a slide show about this skill to showcase what it can do. Link to skill below







