
David King
22 posts

David King
@theuttermost
MLX Dabbler • Cardano HODLer ₳ • Vision Pro ᯅ • https://t.co/FsgwWiRmh6 Junkie • https://t.co/57hQcsMWWs π


Some good progress from @Youssofal_ on MTPLX.. 50 tok/s Qwen3.6 27b-Q4 on M5Max and far more success in tool calls. Obligatory Dungeon Keeper 2 prompt.




Introducing MTPLX V0.2 The Fastest MTP On Mac Qwen 3.6 27B: - 30-40% Faster Decode TPS VS OMLX - 5-10% Lower Memory Usage VS OMLX - Only 5 - 10% Worse Prefill Speeds At Long Contexts. Big thank you to @ivanfioravanti who gave me lots of useful benchmark data!















Quantization can make an LLM 4x smaller and 2x faster, with barely any quality loss. But what *is* it? @samwhoo crafted a beautiful interactive essay explaining it from first principles, aimed at coders, not mathematicians. ngrok.com/blog/quantizat…




People of pi. BIG NEWS. I've sold out. Let me know how you feel about this in the comments below. mariozechner.at/posts/2026-04-…













