Chunsan Hong

2 posts

Chunsan Hong

Chunsan Hong

@ChunsanHong

I focus on improving generative models.

Seoul, Korea Katılım Şubat 2024
30 Takip Edilen9 Takipçiler
Chunsan Hong retweetledi
Sam Acquaviva
Sam Acquaviva@Sam_Acqua·
Flow models are a promising alternative to autoregression. But the current standard of evaluating flow models is broken. The reported 3x improvement in 1024-step PPL since 2023 is closer to 1.1x if you control for sample entropy. (1/12)
English
7
30
163
48.2K
Chunsan Hong retweetledi
Discrete Diffusion Reading Group
Discrete Diffusion Reading Group@diffusion_llms·
📢 May 11 (Mon): Unifying Masked Diffusion Models with Various Generation Orders and Beyond 🤔AR generates left-to-right; masked diffusion generates in any order; and block diffusion generates block-wise left-to-right, with random order within each block. Can we unify all these frameworks and further learn the generation order jointly with token prediction? 💡The authors propose OeMDM, a unified masked diffusion framework that can express various generation orders, and LoMDM, which jointly learns the generation order and the diffusion model. 🔍Everything comes down to the scheduler: by making the forward and reverse schedulers maximally flexible, it becomes possible to describe all generation orders, even learnable generation orders, within the masked diffusion framework. 📈LoMDM achieves SOTA among discrete diffusion models across all benchmarks, and even outperforms block diffusion models, which strongly benefit from left-to-right bias! This Monday, Chunsan Hong (@ChunsanHong) will present his paper, which received Spotlight at ICML 2026. Collaborators of this work include: Sanghyun Lee, Jong Chul Ye (bispl.weebly.com/professor.html) Paper link: arxiv.org/abs/2602.02112
Discrete Diffusion Reading Group tweet media
English
2
5
23
7.4K