BURKOV@burkov
A new curriculum on @ChapterPal: Must-read papers on diffusion language models
Diffusion language models represent a fundamental reimagining of how machines generate text — moving away from the autoregressive, left-to-right token prediction that has dominated the field, toward an iterative denoising process borrowed from continuous diffusion in vision.
Rather than committing to each word sequentially, these models learn to refine entire sequences from noise, enabling parallel generation, flexible conditioning, and novel forms of control over the output.
Though the paradigm originated in image synthesis, adapting it to the discrete, structured nature of language has required deep innovations in noise schedules, representation, and training objectives — innovations that are still actively unfolding.
By reading the papers of this curriculum, learners will master the foundational theories, innovative architectures, and critical techniques of diffusion language models, enabling them to master and advance the science of artificial language generation from first principles.
Learn from the best with AI tutor: chapterpal.com/curriculum/aef…