Discrete Diffusion Reading Group (@diffusion_llms) - ملف تويتر

تغريدة مثبتة

Discrete Diffusion Reading Group@diffusion_llms·31 Eki

Drowning in the sea of Discrete Diffusion papers? 🌊 We got you. Join our Reading Group! From theory → empirics, and language → molecules — we’ll decode the chaos together 💫 Join the cult—uh, I mean community 😇 👉 Google Group: groups.google.com/g/diffusion-ll… (1 / 2)

Discrete Diffusion Reading Group tweet media

English

2

7

36

8.1K

Discrete Diffusion Reading Group أُعيد تغريده

Justin Deschenaux@jdeschena·4h

Interested in our work on Ψ-samplers? Make sure to join on Monday!

Discrete Diffusion Reading Group@diffusion_llms

📢 Mar 23 (Mon): The Diffusion Duality, Chapter II: Ψ-Samplers and Efficient Curriculum ☯️The Diffusion Duality (Duo) (ICML 2025) showed that uniform-state discrete diffusion arises from Gaussian diffusion. 🔮The new Chapter II paper (ICLR 2026) introduces Ψ-samplers: non-Markovian predictor-corrector samplers for arbitrary noise priors! Unlike ancestral sampling which plateaus, Ψ-samplers exhibit improved test-time scaling, beating MDLM on language generation (OpenWebText) and image generation (CIFAR-10). ⚡️The authors also reformulated the Gaussian curriculum from Duo, reducing its training time by 25% while matching perplexity and downstream accuracy. This Monday, Justin Deschenaux (@jdeschena) will present his paper, published with collaborators Caglar Gulcehre (@caglarml) and Subham Sahoo (@ssahoo_) Paper link: arxiv.org/abs/2602.21185

English

0

2

16

1.2K

Discrete Diffusion Reading Group أُعيد تغريده

Zhihan Yang@zhihanyang_·4h

Join our reading group next Monday! Paper: The Diffusion Duality, Chapter II: Ψ-Samplers and Efficient Curriculum Presenter: Justin Deschenaux (EPFL) @jdeschena

Discrete Diffusion Reading Group@diffusion_llms

📢 Mar 23 (Mon): The Diffusion Duality, Chapter II: Ψ-Samplers and Efficient Curriculum ☯️The Diffusion Duality (Duo) (ICML 2025) showed that uniform-state discrete diffusion arises from Gaussian diffusion. 🔮The new Chapter II paper (ICLR 2026) introduces Ψ-samplers: non-Markovian predictor-corrector samplers for arbitrary noise priors! Unlike ancestral sampling which plateaus, Ψ-samplers exhibit improved test-time scaling, beating MDLM on language generation (OpenWebText) and image generation (CIFAR-10). ⚡️The authors also reformulated the Gaussian curriculum from Duo, reducing its training time by 25% while matching perplexity and downstream accuracy. This Monday, Justin Deschenaux (@jdeschena) will present his paper, published with collaborators Caglar Gulcehre (@caglarml) and Subham Sahoo (@ssahoo_) Paper link: arxiv.org/abs/2602.21185

English

0

2

6

309

Discrete Diffusion Reading Group@diffusion_llms·4h

📢 Mar 23 (Mon): The Diffusion Duality, Chapter II: Ψ-Samplers and Efficient Curriculum ☯️The Diffusion Duality (Duo) (ICML 2025) showed that uniform-state discrete diffusion arises from Gaussian diffusion. 🔮The new Chapter II paper (ICLR 2026) introduces Ψ-samplers: non-Markovian predictor-corrector samplers for arbitrary noise priors! Unlike ancestral sampling which plateaus, Ψ-samplers exhibit improved test-time scaling, beating MDLM on language generation (OpenWebText) and image generation (CIFAR-10). ⚡️The authors also reformulated the Gaussian curriculum from Duo, reducing its training time by 25% while matching perplexity and downstream accuracy. This Monday, Justin Deschenaux (@jdeschena) will present his paper, published with collaborators Caglar Gulcehre (@caglarml) and Subham Sahoo (@ssahoo_) Paper link: arxiv.org/abs/2602.21185

English

0

4

9

1.8K

Discrete Diffusion Reading Group أُعيد تغريده

Subham Sahoo@ssahoo_·3d

Great minds think alike! The weighted cross-entropy term now standard in diffusion LLM training also appeared in three papers at once [1,2,3]. MDLM is winning the citation race, likely because its language modeling experiments were the most compelling. [1] MDLM: Sahoo et al., Neurips 2025 [2] MD4: @thjashin et al., Neurips 2025 [3] RADD: @Jingyang_Ou et al., ICLR 2026

Julia Turc@juliarturc

I'm learning that Flow Matching is not only a bonkers idea, but it was proposed by 3 different groups simultaneously. Goes to show that, when ideas are ripe, they surface. We are mere passive vessels. That's my Monday afternoon nihilistic rant.

English

4

6

125

12.8K

Discrete Diffusion Reading Group@diffusion_llms·3d

Missed today's session? Make sure to check the recording on YouTube: youtube.com/watch?v=_-3VwB…

YouTube

Discrete Diffusion Reading Group@diffusion_llms

📢Mar 16 (Mon): Discrete Feynman-Kac Correctors 🤔Discrete diffusion models are powerful, but out of the box they give little control over the target distribution!! 🔑Discrete Feynman-Kac Correctors fix this by using Sequential Monte Carlo (SMC) to modify the distribution by - Annealing - Composing multiple models, or - Tilting with external reward functions. All at inference time with no retraining needed! 💡This unlocks things like boosting coding performance, sampling across a range of temperatures in the Ising model, and generating higher quality protein sequences. This Monday, Mohsin Hasan (Université de Montréal, Mila) (hasanmohsin.github.io) and Viktor Ohanesian (Imperial College London) (@OhanesianViktor, scholar.google.com/citations?user…) will co-present their jointly led paper Discrete Feynman-Kac Correctors. Collaborators: Artem Gazizov (Harvard), @Yoshua_Bengio, @A_Aspuru_Guzik, Roberto Bondesan (Imperial College London), @martoskreto, @k_neklyudov Paper link: arxiv.org/abs/2601.10403

English

0

2

10

1.8K

Discrete Diffusion Reading Group أُعيد تغريده

Subham Sahoo@ssahoo_·5d

These emails melt my heart. I was once that PhD student: lost, isolated, and unsure where to turn. Not everyone gets access to the right rooms, so we created the @diffusion_llms reading group. If that’s you, join our Discord and say hi: d-llms.com

English

5

27

493

35.9K

Discrete Diffusion Reading Group أُعيد تغريده

Zhihan Yang@zhihanyang_·13 Mar

Join our reading group next Monday! Paper: Discrete Feynman-Kac Correctors Presenters: Mohsin Hasan (Mila), Viktor Ohanesian (ICL)

Discrete Diffusion Reading Group@diffusion_llms

📢Mar 16 (Mon): Discrete Feynman-Kac Correctors 🤔Discrete diffusion models are powerful, but out of the box they give little control over the target distribution!! 🔑Discrete Feynman-Kac Correctors fix this by using Sequential Monte Carlo (SMC) to modify the distribution by - Annealing - Composing multiple models, or - Tilting with external reward functions. All at inference time with no retraining needed! 💡This unlocks things like boosting coding performance, sampling across a range of temperatures in the Ising model, and generating higher quality protein sequences. This Monday, Mohsin Hasan (Université de Montréal, Mila) (hasanmohsin.github.io) and Viktor Ohanesian (Imperial College London) (@OhanesianViktor, scholar.google.com/citations?user…) will co-present their jointly led paper Discrete Feynman-Kac Correctors. Collaborators: Artem Gazizov (Harvard), @Yoshua_Bengio, @A_Aspuru_Guzik, Roberto Bondesan (Imperial College London), @martoskreto, @k_neklyudov Paper link: arxiv.org/abs/2601.10403

English

0

3

10

1.5K

Discrete Diffusion Reading Group@diffusion_llms·13 Mar

📢Mar 16 (Mon): Discrete Feynman-Kac Correctors 🤔Discrete diffusion models are powerful, but out of the box they give little control over the target distribution!! 🔑Discrete Feynman-Kac Correctors fix this by using Sequential Monte Carlo (SMC) to modify the distribution by - Annealing - Composing multiple models, or - Tilting with external reward functions. All at inference time with no retraining needed! 💡This unlocks things like boosting coding performance, sampling across a range of temperatures in the Ising model, and generating higher quality protein sequences. This Monday, Mohsin Hasan (Université de Montréal, Mila) (hasanmohsin.github.io) and Viktor Ohanesian (Imperial College London) (@OhanesianViktor, scholar.google.com/citations?user…) will co-present their jointly led paper Discrete Feynman-Kac Correctors. Collaborators: Artem Gazizov (Harvard), @Yoshua_Bengio, @A_Aspuru_Guzik, Roberto Bondesan (Imperial College London), @martoskreto, @k_neklyudov Paper link: arxiv.org/abs/2601.10403

English

0

11

39

10.7K

Discrete Diffusion Reading Group@diffusion_llms·11 Mar

First-ever discrete diffusion tutorial by @ssahoo_ @JCJesseLai 🔥

Subham Sahoo@ssahoo_

📢@CVPR 2026: first-ever tutorial dedicated to DISCRETE DIFFUSION 🔥 Part I: Consistency Models + Flow Maps - @JCJesseLai Part II: Discrete Diffusion - by me. ✨Few-step gen + inference-time scaling + live demos Co-orgs: @StefanoErmon @DrYangSong @mittu1204 @gimdong58085414 Full schedule + details👇 (1/3)

English

0

5

19

1.5K

Discrete Diffusion Reading Group أُعيد تغريده

Subham Sahoo@ssahoo_·11 Mar

📢@CVPR 2026: first-ever tutorial dedicated to DISCRETE DIFFUSION 🔥 Part I: Consistency Models + Flow Maps - @JCJesseLai Part II: Discrete Diffusion - by me. ✨Few-step gen + inference-time scaling + live demos Co-orgs: @StefanoErmon @DrYangSong @mittu1204 @gimdong58085414 Full schedule + details👇 (1/3)

English

5

41

329

20.1K

Discrete Diffusion Reading Group@diffusion_llms·9 Mar

Meeting link: teams.live.com/meet/933660122…

English

0

122

Discrete Diffusion Reading Group@diffusion_llms·5 Mar

📢Mar 9 (Mon): CANDI: Hybrid Discrete-Continuous Diffusion Models 🤔Continuous diffusion dominates image generation. LLMs process text through continuous embeddings. So why does discrete diffusion still win for language? 🍬CANDI explains why — it’s a “temporal dissonance”: at large vocabulary sizes, Gaussian noise destroys token identity way before it meaningfully degrades the continuous signal. The model can either learn discrete conditional structure or continuous geometry, but not both simultaneously. 🔑The fix? Keep some tokens clean as anchors for discrete structure, corrupt the rest with Gaussian noise. Decoupling the two lets the model learn both simultaneously — enabling off-the-shelf classifier guidance and better low-NFE generation. This Monday, Patrick Pynadath (Purdue) (patrickpynadath1.github.io, @PatrickPyn35903) will present his paper CANDI: Hybrid Discrete-Continuous Diffusion Models. Collaborators: @thjashin and @ruqi_zhang Paper link: arxiv.org/abs/2510.22510

English

1

4

23

6.2K

Discrete Diffusion Reading Group@diffusion_llms·5 Mar

⚠️To folks in CET and IST (along with many other time zones): Daylight Saving in US ends on Mar 8, so the reading group will happen 1h earlier than the usual time in your time zone from now on! Old: 7 PM CET, 11:30 PM IST New: 6 PM CET, 10:30 PM IST

Discrete Diffusion Reading Group@diffusion_llms

📢Mar 9 (Mon): CANDI: Hybrid Discrete-Continuous Diffusion Models 🤔Continuous diffusion dominates image generation. LLMs process text through continuous embeddings. So why does discrete diffusion still win for language? 🍬CANDI explains why — it’s a “temporal dissonance”: at large vocabulary sizes, Gaussian noise destroys token identity way before it meaningfully degrades the continuous signal. The model can either learn discrete conditional structure or continuous geometry, but not both simultaneously. 🔑The fix? Keep some tokens clean as anchors for discrete structure, corrupt the rest with Gaussian noise. Decoupling the two lets the model learn both simultaneously — enabling off-the-shelf classifier guidance and better low-NFE generation. This Monday, Patrick Pynadath (Purdue) (patrickpynadath1.github.io, @PatrickPyn35903) will present his paper CANDI: Hybrid Discrete-Continuous Diffusion Models. Collaborators: @thjashin and @ruqi_zhang Paper link: arxiv.org/abs/2510.22510

English

0

3

7

872

Discrete Diffusion Reading Group أُعيد تغريده

Patrick Pynadath@PatrickPyn35903·5 Mar

Very excited to chat about continuous and discrete diffusion for language!

Discrete Diffusion Reading Group@diffusion_llms

📢Mar 9 (Mon): CANDI: Hybrid Discrete-Continuous Diffusion Models 🤔Continuous diffusion dominates image generation. LLMs process text through continuous embeddings. So why does discrete diffusion still win for language? 🍬CANDI explains why — it’s a “temporal dissonance”: at large vocabulary sizes, Gaussian noise destroys token identity way before it meaningfully degrades the continuous signal. The model can either learn discrete conditional structure or continuous geometry, but not both simultaneously. 🔑The fix? Keep some tokens clean as anchors for discrete structure, corrupt the rest with Gaussian noise. Decoupling the two lets the model learn both simultaneously — enabling off-the-shelf classifier guidance and better low-NFE generation. This Monday, Patrick Pynadath (Purdue) (patrickpynadath1.github.io, @PatrickPyn35903) will present his paper CANDI: Hybrid Discrete-Continuous Diffusion Models. Collaborators: @thjashin and @ruqi_zhang Paper link: arxiv.org/abs/2510.22510

English

0

3

14

992

Discrete Diffusion Reading Group@diffusion_llms·3 Mar

📢 Missed the talk? Make sure to check the recording on YouTube! youtu.be/OYgq_3zf3IE 👀

YouTube

Discrete Diffusion Reading Group@diffusion_llms

📢Mar 2 (Mon): Reasoning with Latent Tokens in Diffusion Language Models ❓Diffusion language models outperform AR on synthetic reasoning tasks, but why? 🔑This paper traces the answer to a surprising mechanism: diffusion models naturally maintain "latent tokens" -- joint predictions over positions they won't immediately decode -- that enable planning and lookahead! Latent tokens control a smooth tradeoff between inference speed and quality, and that this mechanism yields large gains in AR models on the same reasoning tasks where they've traditionally struggled. This Monday, Andre He (LTI @ CMU) (andrehe02.github.io, @Andre3035858461) will present his recent paper Reasoning with Latent Tokens in Diffusion Language Models! Collaborators: Sean Welleck (@wellecks), Daniel Fried (@dan_fried) Paper link: arxiv.org/abs/2602.03769

English

0

4

18

2.6K

Discrete Diffusion Reading Group أُعيد تغريده

Subham Sahoo@ssahoo_·3 Mar

🔥Duo Chapter-2 is trending on scholar-inbox.com/home 👇Check the following thread for more details about the paper

Subham Sahoo@ssahoo_

🔥New Paper drop: The Diffusion Duality (Ch. 2): 𝚿-Samplers #ICLR2026 🚀 Inference‑time scaling for uniform diffusion‑LLMs (Duo) 🥊 Beats Masked diffusion on text + image generation 🔖 openreview.net/forum?id=RSIoY… 🌐 s-sahoo.com/duo-ch2/ 🖥️ github.com/s-sahoo/duo w/ @jdeschena @caglarml (1 / 3)

English

0

3

27

3.3K

Discrete Diffusion Reading Group@diffusion_llms·26 Şub

📢Mar 2 (Mon): Reasoning with Latent Tokens in Diffusion Language Models ❓Diffusion language models outperform AR on synthetic reasoning tasks, but why? 🔑This paper traces the answer to a surprising mechanism: diffusion models naturally maintain "latent tokens" -- joint predictions over positions they won't immediately decode -- that enable planning and lookahead! Latent tokens control a smooth tradeoff between inference speed and quality, and that this mechanism yields large gains in AR models on the same reasoning tasks where they've traditionally struggled. This Monday, Andre He (LTI @ CMU) (andrehe02.github.io, @Andre3035858461) will present his recent paper Reasoning with Latent Tokens in Diffusion Language Models! Collaborators: Sean Welleck (@wellecks), Daniel Fried (@dan_fried) Paper link: arxiv.org/abs/2602.03769

English

0

6

72

26.2K

Discrete Diffusion Reading Group

اكتشف