Molei Tao

363 posts

Molei Tao

Molei Tao

@MoleiTaoMath

Georgia Tech Prof; Tsinghua, Caltech, NYU Courant * deep learning theory * (diffusion) generative model, probabilistic ML * AI4Science * applied & comput. math

เข้าร่วม Ekim 2021
204 กำลังติดตาม1.8K ผู้ติดตาม
ทวีตที่ปักหมุด
Molei Tao
Molei Tao@MoleiTaoMath·
Does GenAI create new knowledge? arxiv.org/abs/2602.06021 gives * 1st explicit characterization of diffusion model's generalization * more precise than offered by classical stat. learning theory * systematic integration of various inductive biases (training+architecture+inference)
English
3
26
165
11.1K
Molei Tao รีทวีตแล้ว
Paata Ivanisvili
Paata Ivanisvili@PI010101·
Every subgaussian is a sum of Gaussians -- Antoine Song resolves Talagrand's conjectures arxiv.org/pdf/2602.22342
Paata Ivanisvili tweet media
English
6
65
399
38.6K
Nika Haghtalab
Nika Haghtalab@nhaghtal·
This week I was promoted to the rank of Associate Professor at @Berkeley_EECS ! In a remarkable show of enthusiasm, the committee apparently tore a hole in spacetime to make me an Associate Professor 9 months ago!
English
58
14
717
53.9K
Molei Tao
Molei Tao@MoleiTaoMath·
@ottogin1 Thank you for the very interesting question! I feel analysis with cfg is possible, but we haven't done it yet. I also feel I can understand and agree with your excellent intuition. Hopefully I can reply with something more scientific in the future. Really appreciate your reply.
English
1
0
2
64
Artem Lukoianov
Artem Lukoianov@ottogin1·
Hi @MoleiTaoMath ! Thank you for sharing your work, I actually read it in February, very interesting. I was wondering how does CFG plug into your analysis. It seems that CFG should play a crucial role in the alignment phase, and almost feels like it should be bad for alignment (but in practice it is not?). This reminds me of some thoughts from "Guiding a Diffusion Model with a Bad Version of Itself".
English
1
0
2
124
Jiaxin Shi
Jiaxin Shi@thjashin·
@liyzhen2 Haven’t been following lately - would be good to understand the connection!
English
2
0
1
846
Molei Tao
Molei Tao@MoleiTaoMath·
An interesting connection between drifting model and score-based generative model
Chieh-Hsin (Jesse) Lai@JCJesseLai

[1/D] 🤔 What are drifting models really connected to? 📢 Our new paper, A Unified View of Drifting and Score-Based Models, shows that the bridge to score-based models is clear and precise (w/ team and @mittu1204, @StefanoErmon, @MoleiTaoMath)! ✍️ Main takeaway: drifting is more closely connected to score-based (diffusion) modeling than it may first appear! 🔗 arxiv.org/abs/2603.07514 🎯 Here’s why: Drifting’s mean-shift moves a sample toward the kernel-weighted average of nearby samples. Score function points toward regions of higher density. So both describe local directions that push samples toward where data is denser. We show that this link is exact for Gaussian kernels (Section 4.1): 📌drifting’s mean-shift = a rescaled score-matching field between the Gaussian-smoothed data and model distributions — the vector field underlying score matching (Tweedie!). 📌This also clarifies the bridge to Distribution Matching Distillation (DMD): both use score-based transport directions, but only differ in how the score is realized—drifting does so nonparametrically through kernel neighborhoods, whereas DMD relies on a pretrained diffusion teacher. 🤔 So what happens for the default Laplace kernel used in drifting models? Let’s look below 👇

English
1
7
47
5K
Molei Tao รีทวีตแล้ว
Wei Guo
Wei Guo@WeiGuo01·
How to bring the speed & precision of continuous adjoint matching (AM) to discrete neural samplers? Introducing discrete adjoint Schrödinger bridge sampler (DASBS): a unified framework for authentic discrete AM! 🎲✨ Joint work with @JaemooChoi et al.: 📄arxiv.org/abs/2602.08243
Wei Guo tweet media
English
1
10
39
3.5K
Molei Tao รีทวีตแล้ว
Lenka Zdeborova
Lenka Zdeborova@zdeborova·
🚀 The Applied Maths department at École Polytechnique is hiring aMonge Assistant Professor (Tenure track) in Statistical Learning & AI for Mathematics and Science. 📌 For all details and to apply: candidatures-calliope.polytechnique.fr/calliope-fo/re…
English
0
19
68
6.3K
Molei Tao รีทวีตแล้ว
Statistics (Machine Learning) Papers
How Does the ReLU Activation Affect the Implicit Bias of Gradient Descent on High-dimensional Neural Network Regression? Kuo-Wei Lai, Guanghui Wang, Molei Tao, Vidya Muthukumar arxiv.org/abs/2603.04895 [𝚜𝚝𝚊𝚝.𝙼𝙻 𝚌𝚜.𝙻𝙶 𝚖𝚊𝚝𝚑.𝙾𝙲]
Statistics (Machine Learning) Papers tweet media
English
0
2
13
873
Molei Tao รีทวีตแล้ว
Francis Bach
Francis Bach@BachFrancis·
Looking for alternatives to quadratic functions for closed-form analysis in optimization? This post explores matrix Riccati dynamics and their applications to neural networks. francisbach.com/closed-form-dy…
GIF
English
0
23
158
9.1K
Molei Tao รีทวีตแล้ว
Leonardo de Moura
Leonardo de Moura@Leonard41111588·
AI is writing a growing share of the world's software. No one is formally verifying any of it. New essay: "When AI Writes the World's Software, Who Verifies It?" leodemoura.github.io/blog/2026/02/2…
English
41
246
1.6K
421.1K
Molei Tao
Molei Tao@MoleiTaoMath·
@josephdviviano Thank you Joseph for the kind words! Glad that we are interested in similar (and important, I think) problems.
English
0
0
1
87
Joseph Viviano
Joseph Viviano@josephdviviano·
@MoleiTaoMath I've been looking high and low for something like this, thank you
English
1
0
1
349
Molei Tao
Molei Tao@MoleiTaoMath·
Does GenAI create new knowledge? arxiv.org/abs/2602.06021 gives * 1st explicit characterization of diffusion model's generalization * more precise than offered by classical stat. learning theory * systematic integration of various inductive biases (training+architecture+inference)
English
3
26
165
11.1K