

Tons of papers re diffusion/flow matching at ML confs these days, but to my surprise very few of them consider learning the prior🤔 Am I missing any important work here? 🙏 for suggestions
Zhuo Sun
111 posts

@JasonSun10
Assistant Professor@SUFE, PhD in Comp. Stats & Machine Learning@University College London


Tons of papers re diffusion/flow matching at ML confs these days, but to my surprise very few of them consider learning the prior🤔 Am I missing any important work here? 🙏 for suggestions








Our paper "Multilevel Control Functional" with score 8,8,8 accepted at ICLR 2026, is not recommended 'oral' at ICLR, which ranks top 20 in over 19000 submissions #iclr


new preprint! turns out, if your model is confident on _any_ long enough input, we can find other inputs where the model is wrong, yet its perplexity won't really tell you it's wrong 📉 work with @fedzbar @ccperivol @sindero and Razvan


A new robust solution to goodness-of-fit (GOF) testing for unnormalized models! Joint work with @fx_briol. ❌Existing kernel GOF tests based on KSD are NOT robust. ✅We introduce a simple, provably robust extension that adds no extra computational cost! arxiv.org/abs/2408.05854

Diffusion serving is expensive: dozens of timesteps per image, and a lot of redundant compute between adjacent steps. ⚡vLLM-Omni now supports diffusion cache acceleration backends (TeaCache + Cache-DiT) to reuse intermediate Transformer computations — no retraining, minimal quality impact! 🚀Benchmarks (NVIDIA H200, Qwen-Image 1024x1024): TeaCache 1.91x, Cache-DiT 1.85x. For Qwen-Image-Edit, Cache-DiT hits 2.38x! Blog: blog.vllm.ai/2025/12/19/vll… Docs: docs.vllm.ai/projects/vllm-… #vLLM #vLLMOmni #DiffusionModels #AIInference
