
We’re happy to support Guanghan and team in their research as they push dLLM RL forward
Check out d2 now
Guanghan Wang@Guanghan__Wang
🚀 Introducing d2 — a principled and efficient RL framework for improving reasoning in diffusion language models (DLMs). RL works well for autoregressive LLMs. But for DLMs? It’s fundamentally harder. We show how to do it right. 👇 📖 arxiv.org/abs/2509.21474 🌐 guanghanwang.com/d2 💻 github.com/kuleshov-group… 🧵1/12
English

