Gabe Guo 🦄

60 posts

Gabe Guo 🦄 banner
Gabe Guo 🦄

Gabe Guo 🦄

@therealgabeguo

PhD Student in CS (generative AI) @Stanford Funded by @ENERGY Formerly @Columbia

Buffalo, NY Katılım Ocak 2024
161 Takip Edilen135 Takipçiler
Gabe Guo 🦄 retweetledi
Xavier Gonzalez
Xavier Gonzalez@xavierjgonzalez·
Fixed point iterations for parallelizing nonlinear dynamics is all the rage: - Newton for RNNs - Picard for diffusion models - Jacobi for parallel decode of LLMs But how do these techniques relate, and when should you use them? We show you how in our new paper 🧵
English
6
27
169
19.9K
Gabe Guo 🦄 retweetledi
evo-devo
evo-devo@Xiaojie_Qiu·
I am thrilled to share a paradigm-changing work in generative modeling: Flux Matching by the very brilliant graduate student Peter @peterpaohuang (co-mentored with @StefanoErmon). By extending beyond the score functions used in diffusion models to a broader class of vector fields, Flux Matching enables structural priors in dynamics, faster sampling, more interpretable generation, and many new possibilities. In biology, Peter shows that replacing the EM algorithm in scVelo with Flux Matching can dramatically improve RNA velocity accuracy, including cross-boundary correctness and consistency. Its ability to train on large-scale single-cell and perturbation data makes it especially exciting for building better causal virtual cell and virtual embryo models. I am deeply grateful for the support from Laude Institute @LaudeInstitute , Pantas and Ting Sutardja Foundation, the Wu Tsai Neurosciences Institute Big Ideas in Neuroscience Program, NIH DP2 grant 1DP2OD037052-01, and NIH K99/R00 grant 4K99HG012887-02 @NIH_CommonFund. Most importantly, I am deeply honored to have Peter as the first graduate student in the lab! I want to congratulate Peter on this outstanding achievement. He developed this idea independently, drawing on his background in causal learning, diffusion models, and Perturb-seq, and pushed through many technical challenges with remarkable creativity, persistence, and diligence. I cannot wait to see the impact this work will have in both machine learning and biology! See more information from Peter below:
Peter Pao-Huang@peterpaohuang

Introducing Flux Matching, a generative modeling paradigm that generalizes diffusion models to vector fields that need not be the score function. Enables structural priors in the dynamics, faster sampling, interpretable generation, and more! w/ @StefanoErmon @Xiaojie_Qiu 🧵⤵️

English
0
6
52
10.2K
Gabe Guo 🦄 retweetledi
Peter Pao-Huang
Peter Pao-Huang@peterpaohuang·
Introducing Flux Matching, a generative modeling paradigm that generalizes diffusion models to vector fields that need not be the score function. Enables structural priors in the dynamics, faster sampling, interpretable generation, and more! w/ @StefanoErmon @Xiaojie_Qiu 🧵⤵️
English
17
159
963
120.9K
Gabe Guo 🦄 retweetledi
Gilad
Gilad@giladturok·
1/ 🚨 New paper! DUEL: Exact Likelihood for Masked Diffusion via Deterministic Unmasking We give masked diffusion models (MDMs) proper likelihood — and therefore proper perplexity — for the first time. Turns out MDMs are closer to autoregressive models than previously thought.
Gilad tweet media
English
1
16
136
22.3K
Siddharth Ancha
Siddharth Ancha@siddancha·
Really cool work @therealgabeguo! 👏👏 Looking forward to reading all the fun details in the paper! We did something similar for robot action sequences in continuous time and space: x.com/siddancha/stat… (streaming-flow-policy.github.io) that unifies autoregressive and diffusion-based generation, but using ODEs instead of SDEs, and has some of the advantages you stated.
Siddharth Ancha@siddancha

Diffusion/flow policies 🤖 sample a “trajectory of trajectories” — a diffusion/flow trajectory of action trajectories. Seems wasteful? Presenting Streaming Flow Policy that simplifies and speeds up diffusion/flow policies by treating action trajectories as flow trajectories! 🌐 streaming-flow-policy.github.io 🧵 1/15

English
1
2
6
928
Gabe Guo 🦄
Gabe Guo 🦄@therealgabeguo·
🔥Empirically, ABC generates high quality videos, weather forecasts, & more. We look forward to unlocking its potential for scientific applications at scale. 🙏Thanks to @ENERGY for funding this research, and @StanfordHAI, @nvidia, & @NERSC for generous compute donations.🇺🇸🌲
English
0
0
2
284
Gabe Guo 🦄
Gabe Guo 🦄@therealgabeguo·
🌟Furthermore, ABC is a unification of the two dominant generative modeling paradigms: autoregressive and diffusion models. It extends autoregressive modeling to continuous time and space, and extends diffusion models to the non-Markovian case.
English
1
0
4
293
Gabe Guo 🦄 retweetledi
Elon Litman
Elon Litman@elon_lit·
We developed a unified theory of generalization in deep learning. It explains grokking, double descent, benign overfitting, and implicit bias. But theory is only half the story. It turns out that optimizing the population risk of any neural network amounts to a small change to your optimizer. 🧵
Elon Litman tweet media
English
21
128
1K
74.9K
Gabe Guo 🦄 retweetledi
Elon Litman
Elon Litman@elon_lit·
GOAT has been accepted to ICML! See you in Seoul🔥🐐🐐
Elon Litman tweet media
English
8
94
1.1K
109K
Gabe Guo 🦄 retweetledi
Xavier Gonzalez
Xavier Gonzalez@xavierjgonzalez·
Parallelizing nonlinear RNNs is gaining traction! More efficient than transformers; more expressive than linear RNNs. My PhD thesis provides an intro guide to the math (Newton's method) behind the parallelization. Great as a quick-start if you want to explore this new field!
Xavier Gonzalez tweet media
English
7
46
369
34.8K
Gabe Guo 🦄 retweetledi
Owen Dugan
Owen Dugan@OwenDugan·
Happy 🦃 Thanksgiving weekend! 🍂 This year, we cooked up a new recipe for juicy fact-storing MLPs. Instead of picking apart trained models, we asked: Can we construct fact-storing MLPs from scratch? 🤔 Spoiler: we can & we figured out how to slot these hand-crafted MLPs into Transformer blocks as modular fact stores! 🧩 New work with @garctrob @ronnygjunkins @jerrywliu @dylan_zinsley @EyubogluSabri Atri Rudra @HazyResearch! 🧵👇
Owen Dugan tweet media
English
8
47
340
64.6K