
Mamba-3 just dropped: SSMs are catching Transformers.
At 1.5B params, beats Llama-3.2-1B on latency across ALL sequence lengths.
Secret: complex-valued states + MIMO — more accuracy without sacrificing decode speed.
Transformer monopoly is ending. #AI #OpenSource
English











