mlfeed.tech đã retweet

NVIDIA Nemotron 3: Efficient and Open Intelligence
NVIDIA introduces a family of models (Nano, Super, Ultra) using hybrid Mamba-Transformer MoE architecture with up to 1M token context and state-of-the-art reasoning performance.
📝 arxiv.org/abs/2512.20856
English


































