Ivan Rocha retweetledi

Introducing Nucleus-Image: the first sparse Mixture-of-Experts diffusion model
17B parameters. Only 2B active. 10x more parameter-efficient than leading diffusion models.
Toe-to-toe with GPT Image 1, Imagen 4, and Qwen-Image: from pure pre-training alone. No DPO. No RL. No preference tuning.
Day 0 support in 🤗 Hugging Face diffusers. Fully open-source under Apache 2.0.
Weights, training code, and dataset recipe - we're not holding anything back <3



English





















