Miguel Miranda retweetledi

Stop buying more VRAM.
Everyone’s posting Qwen 3.6 configs running insanely fast on 12GB cards.
But do you actually understand the flags making it possible? Weights are only half the story. KV cache is eating your VRAM alive.
The secret isn’t just 4-bit weights it’s the KV cache sorcery everyone’s missing.
Here’s the annotated command & real tricks explained:
@elonmusk @grok #Ai

English































