
Dustin Ogle
1.4K posts

Dustin Ogle
@DustinOgle33
Formerly $45k/mo online business owner. Building an Agent Harness inside a game. Watch AI debate and work. Average run ~2hr, ~1M tokens. https://t.co/8OkuSSMN73


GPT-5.6 rumors: • Beats Fable at agentic coding • 3x cheaper • Less censored • Flips sentiment from Anthropic to Opus tiny chance it arrives tomorrow next week is where things get interesting












Intelligence should be open, accessible, and ready to build with, empowering every developer, everywhere. GLM-5.2 is now available to all GLM Coding Plan users, including Lite, Pro, Max, and Team plans. docs.z.ai/devpack/latest… As our new flagship model, GLM-5.2 delivers powerful coding capabilities, usable 1M-context support, and continued strengths in long-horizon tasks. API and Chatbot services will launch next week. The model will also be officially open-sourced next week under the MIT License. The future of AI is open, and it belongs to the people.








MiniMax M3 is now open source! The model combines native multimodal understanding, ultra-long context, and Agent capabilities in one.🚀 New MSA architecture: up to 1M context at 1/20 the per-token compute of the previous gen. 9x faster prefilling, 15x faster decoding, on par with full attention on most tasks. Two versions 👇: MiniMax-M3 (full precision) and MiniMax-M3-MXFP8 (quantized, lower VRAM). 🤖 modelscope.ai/models/MiniMax… 🤖 modelscope.ai/models/MiniMax… 🧠 12hrs autonomous: reproduced an ICLR 2025 Outstanding Paper end to end, 18 commits + 23 experiment plots ⚡ 147 iterations, 9.4x CUDA speedup: FP8 matmul kernel on Hopper, peak utilization 7.6% → 71.3%, zero human intervention 🛠️ PostTrainBench: scored 37.1, ranking 3rd behind Opus 4.7 (42.4) and GPT-5.5 (39.3)



I’ve been doing “loops” for a while now. I don’t do much traditional prompting. Most of my prompts are barely a sentence expressing an outcome. - my orchestrator prompts parallel agents - my computer use verifier gives it feedback - my security, production, and SEO agents generate prompts for fixes The industry is typically 3-6 months behind what we’re doing at Replit.




Hiring Fable on API pricing Full time (40 hrs / wk) is: $1,248,000 / year wow.


Big day for Mac Studio 512GB players! 🥳 MiniMax M3 and Kimi K2.7 Code just dropped as two powerful new open weights models. Both will still need some quantization to run comfortably on a single machine, but believe the community is already working on it! 😻 (btw GLM-5.2 is on the way 😸)

MiniMax M3, Open-Weight, Now On Hugging Face , with only ~428B parameters and ~23B activated parameters Weights: huggingface.co/MiniMaxAI/Mini… MiniMax Sparse Attention: huggingface.co/papers/2606.13…








