Giosue Migliorini

17 posts

Giosue Migliorini

@joh_sweh

Ph.D. student in stats @UCIrvine | former AI research intern @FlagshipPioneer, @LosAlamosNatLab, @UniBocconi

CA Katılım Haziran 2023

567 Takip Edilen52 Takipçiler

Giosue Migliorini retweetledi

Felix Draxler@FelixDrRelax·22 Nis

LLMs are autoregressive and slow? No! Parallel Token Prediction decodes multiple consistent tokens in one model call. PTP allows arbitrary dependencies in one call, unlike discrete diffusion. Practical: 2.4x speedup github.com/mandt-lab/ptp ICLR: Apr 23, morning poster P3-#608

English

22.7K

Giosue Migliorini@joh_sweh·3 Ara

@JIRIGESI Hi Jiri, I am a fourth year PhD candidate at UCI interested in probabilistic modeling, RL, and multi modal generative models. I’d love to grab a coffee if you are available!

English

Jiri@JIRIGESI·27 Kas

I’ll be at NeurIPS, if you’re interested in a 2026 PhD research internship with Amazon Store Foundation AI and want to work on agents, RL, and multi-modal, I’d love to connect at the conference.

English

1.6K

Giosue Migliorini retweetledi

Statistics (Machine Learning) Papers@StatsPapers·16 Eki

Efficient Inference for Coupled Hidden Markov Models in Continuous Time and Discrete Space. arxiv.org/abs/2510.12916

English

391

Giosue Migliorini@joh_sweh·23 Eki

What a great event at @AIHealthMIT , happy to be a part of #MoML2025 and thanks to the organizers!

MIT Jameel Clinic for AI & Health@AIHealthMIT

Next up in the Top 5: @joh_sweh presents mechanistic interpretability for pair representations in protein co-folding! #MoML2025

English

179

Giosue Migliorini@joh_sweh·31 Oca

@cloneofsimo Ideally we should sample with replacement from the dataset (potentially repeated datapoints in a single batch!). Epochs & not shuffling introduce periodic behaviors

English

180

Simo Ryu@cloneofsimo·31 Oca

Guys do we REALLY need to shuffle at the end of epoch? like REALLY REALLY ?

English

169

19.9K

Giosue Migliorini@joh_sweh·15 Oca

@jxmnop Amazing. It is probably a combination of all of these things.

English

178

dr. jack morris@jxmnop·15 Oca

posted the other day about model distillation. pretty much everyone responded with their theories professors, leading lab researchers, students, pseudoanonymous anime-profile posters seems there's no clear consensus why it works, but here are the theories 🧵

dr. jack morris@jxmnop

it's a baffling fact about deep learning that model distillation works method 1 - train small model M1 on dataset D method 2 (distillation) - train large model L on D - train small model M2 to mimic output of L - M2 will outperform M1 no theory explains this; it's magic

English

565

71.5K

Giosue Migliorini@joh_sweh·8 Oca

@historyinmemes This has been debunked. youtu.be/ikg3-GQLg3g?si…

YouTube

English

174

Historic Vids@historyinmemes·8 Oca

There were about 180 towers in Bologna in the 12th century. The tallest, 97 meters high, still stands.

English

461

5.4K

426.9K

Giosue Migliorini retweetledi

Keenan Crane@keenanisalive·14 Kas

We often think of an "equilibrium" as something standing still, like a scale in perfect balance. But many equilibria are dynamic, like a flowing river which is never changing—yet never standing still. These dynamic equilibria are nicely described by so-called "detailed balance"

English

106

1.7K

10.1K

638.6K

Giosue Migliorini retweetledi

Gabriel Peyré@gabrielpeyre·1 Kas

Bregman divergences are convex distance-like functionals that are locally Euclidean. Most algorithms handling Euclidean distances generalize to Bregman divergences. en.wikipedia.org/wiki/Bregman_d…

English

439

16.9K

Giosue Migliorini retweetledi

Andrej Karpathy@karpathy·26 Ağu

Future be like tab tab tab

Eesti

377

529

7.4K

722.4K

Giosue Migliorini@joh_sweh·27 Tem

@BalintMucsanyi @mkirchhof_ @coallaoh Mine might be a naive question. Why try to measure uncertainty through the predictive entropy, when the predictive variance admits such a nice and interpretable decomposition? I have never seen a confidence interval based on entropy.

English

203

Bálint Mucsányi@BalintMucsanyi·26 Tem

If you've been using formulas like this to split aleatoric from epistemic uncertainty: They don't work. To find out why, come visit our poster #68 at the #ICML2024 #SPIGM workshop at 15:10! @mkirchhof_ @coallaoh

English

203

25.2K

Giosue Migliorini@joh_sweh·27 Haz

@PreetumNakkiran Flow matching should recover the optimal vector field in the Benamou-Brenier perspective of optimal transport if data is sampled from the optimal coupling (in the static ot problem). In that case it would recover the identity

English

313

Preetum Nakkiran@PreetumNakkiran·27 Haz

easiest way to see that Flow Matching does not always produce an optimal transport: observe that the marginal flow from a distribution *to itself* is not the Identity (eg for linear flows & independent coupling)

English

4.9K

Giosue Migliorini retweetledi

Machine Learning (ML) Papers@Memoirs·8 Nis

Dynamic Conditional Optimal Transport through Simulation-Free Flows. arxiv.org/abs/2404.04240

English

249

Giosue Migliorini retweetledi

Sam Altman@sama·15 Şub

FXTUREVESCENT@fxturevescent

@sama Two golden retrievers podcasting on top of a mountain

ZXX

856

4.3K

52.6K

7.8M

Giosue Migliorini retweetledi

Jascha Sohl-Dickstein@jaschasd·12 Şub

Have you ever done a dense grid search over neural network hyperparameters? Like a *really dense* grid search? It looks like this (!!). Blueish colors correspond to hyperparameters for which training converges, redish colors to hyperparameters for which training diverges.

English

298

2.2K

11.3K

1.8M

Giosue Migliorini retweetledi

AI at Meta@AIatMeta·16 Haz

Introducing Voicebox, a new breakthrough generative speech system based on Flow Matching, a new method proposed by Meta AI. It can synthesize speech across six languages, perform noise removal, edit content, transfer audio style & more. More details on this work & examples ⬇️

English

420

1.8K

445.1K

Giosue Migliorini retweetledi

Stat.ML Papers@StatMLPapers·30 May

Functional Flow Matching. (arXiv:2305.17209v1 [cs.LG]) ift.tt/BmXJcbn

English

2.3K

Keşfet

@JIRIGESI @AIHealthMIT @cloneofsimo @jxmnop @historyinmemes @BalintMucsanyi @mkirchhof_ @coallaoh