Chenyang Yuan

13 posts

Chenyang Yuan

Chenyang Yuan

@yuancy

Optimization and ML researcher at Toyota Research Institute, MIT PhD, Berkeley CS

Cambridge, MA Katılım Temmuz 2009
39 Takip Edilen45 Takipçiler
Chenyang Yuan
Chenyang Yuan@yuancy·
4/4 The model I used was GPT 5.4 xhigh on Codex. The code, prompts, harness, verification scripts, generated proof blueprint and Lean formalization are in this GitHub repo: github.com/yuanchenyang/n…
English
0
0
0
47
Chenyang Yuan
Chenyang Yuan@yuancy·
3/4 The harness gave the agent a computational toolkit to autonomously search for counterexamples using optimization solvers, so it can infer structure from dual certificates, write a blueprint, formalize the proof, and keep going until Lean accepted the final theorem.
Chenyang Yuan tweet media
English
1
0
0
84
Chenyang Yuan
Chenyang Yuan@yuancy·
I wrote a blog post about a Codex harness/workflow I built to autonomously prove a new mathematical result after 3 days of continuous work producing ~60k lines of Lean, where the input is a Lean theorem statement and output is a fully formalized proof. 1/4 chenyang.co/blog/automatic…
English
2
6
18
1.9K
Chenyang Yuan
Chenyang Yuan@yuancy·
I'm helping to organize this CVPR tutorial on analytic understanding of diffusion models, join us if you are interested in learning more about how diffusion models generalize!
Artem Lukoianov@ottogin1

Join us at @CVPR in Denver for a full-day tutorial about Analytic Understanding of Diffusion Models. The training objective of diffusion models has a closed-form solution -- yet it only memorizes. How do real models generalize? We'll unpack this paradox and the emerging analytical theory behind it. @yuancy @CScarvelis @MasonKamb @WangBinxu @vincesitzmann @JustinMSolomon @SuryaGanguli

English
0
0
3
105
Chenyang Yuan retweetledi
Artem Lukoianov
Artem Lukoianov@ottogin1·
Why do diffusion models produce new images instead of just memorizing the dataset? We show that they learn pixel correlation patterns from the data and therefore denoise locally, which promotes generalization. To test this idea, we compare trained diffusion models with a training-free algorithm that mixes local patches from the dataset. Surprisingly, this simple procedure already reproduces many properties of the trained models. 🧵 Check out this thread for more details about our Spotlight NeurIPS paper with @yuancy, @JustinMSolomon and @vincesitzmann.
Artem Lukoianov tweet media
English
8
28
258
24.7K
Chenyang Yuan retweetledi
Artem Lukoianov
Artem Lukoianov@ottogin1·
At #NeurIPS today in San Diego? Come check out poster #4409 (4:30–7:30 PM) today. We’re excited to share our spotlight paper on the generalization properties of diffusion models. Looking forward to great research conversations! @yuancy @JustinMSolomon @vincesitzmann
Artem Lukoianov tweet media
English
3
7
36
11.1K
Chenyang Yuan
Chenyang Yuan@yuancy·
I played a small part in the production of this video, and I'm really happy with how it addressed common misconceptions about diffusion models, as well as the beautiful visualizations and animations!
Grant Sanderson@3blue1brown

New video on the details of diffusion models: youtu.be/iv-5mZ_9CPY Produced by @welchlabs, this is the first in a small series of 3b1b this summer. I enjoyed providing editorial feedback throughout the last several months, and couldn't be happier with the result.

English
0
0
0
65
Alec Helbling
Alec Helbling@alec_helbling·
This smalldiffusion project is really fun. It is a tiny library for training and sampling from diffusion models. I love efforts like this, they are much easier to play around with than big libraries like diffusers. Link: github.com/yuanchenyang/s…
English
6
67
444
18.8K
Chenyang Yuan
Chenyang Yuan@yuancy·
In the problem sets, we use the library introduced in the first lecture (github.com/yuanchenyang/s…) to train diffusion models on custom data, as well as using pretrained models as building blocks for a variety of downstream tasks (see examples above) (4/4)
English
0
0
1
86
Chenyang Yuan
Chenyang Yuan@yuancy·
Using score distillation for 3D shape generation (L5) and wrapping up with a summary of the latest research making diffusion models better and faster (L6) The lecture recordings can be found here: youtube.com/playlist?list=… (3/4)
English
1
0
1
178
Chenyang Yuan
Chenyang Yuan@yuancy·
Last month I cotaught a class on diffusion models at MIT during the IAP term: practical-diffusion.org In the lectures, we first introduced diffusion models from a practitioner's perspective, showing how to build a simple but powerful implementation from the ground up (L1) (1/4)
English
1
4
14
1.5K