RobotFlowLabs.com ری ٹویٹ کیا

🚀 DFlash + MLX just dropped.
What it does:
⚡️4x faster LLMs on Apple Silicon.
🚀Zero accuracy loss.
🔥Available right now.
How it works:
🤯Tiny block diffusion model drafts tokens in parallel.
🚨Main LLM verifies in one shot.
Try it now 👇🏻
Zhijian Liu@zhijianliu_
🔥 DFlash x MLX is happening! Shoutout to @aryagm01 for the early work on this. We're building on the momentum. Native MLX support, more models (Qwen3.5), up to 4x faster. Lossless! 👉 github.com/z-lab/dflash
English




















