Sabitlenmiş Tweet

We introduce Orthrus, a dual-architecture that unifies AR-level fidelity with parallel diffusion-style decoding, addressing the memory-bandwidth bottleneck in autoregressive generation.
Paper: arxiv.org/abs/2605.12825
Code: github.com/chiennv2000
Thread🧵
English







