Noam Elata (@NoamElata) - Twitter Profili | Zamantika Mersobahis Locabet

Noam Elata retweetledi

Sean Man@sean_8100·24 Şub

Introducing 🦤 DODO: Discrete OCR Diffusion Models. This work is the result of my summer internship at Amazon and is the first to study masked diffusion models for document parsing. OCR is special: the image already contains the answer. So why decode one token at a time?

English

1

3

9

372

Noam Elata@NoamElata·12 Oca

I'm in London this week, send me a message if you want to chat about Gen AI, diffusion models, or anything else!

English

0

1

9

182

Noam Elata retweetledi

Hyungjin Chung@hyungjin_chung·18 Ara

To folks working on image restoration Please please please read arxiv.org/abs/1711.06077 and report BOTH the distortion metrics AND perceptual metrics when doing evaluation. LPIPS does NOT count as a perception metric.

English

6

11

73

7.2K

Noam Elata retweetledi

Tamar Rott Shaham@TamarRottShaham·12 Ara

@SebastienBubeck I think you'd love our paper! x.com/MIT_CSAIL/stat…

MIT CSAIL@MIT_CSAIL

A picture is worth a thousand words, but can a LLM get the picture if it has never seen images before? 🧵 MIT CSAIL researchers quantify how much visual knowledge LLMs (trained purely on text) have. The visual aptitude of the language model is tested by its ability to write, recognize, and correct drawing code that can be rendered into illustrations. Starting w/language models trained on text alone, they show it is possible to train a preliminary vision system that can make judgments about real images: bit.ly/4cmkBaq

English

0

1

7

616

Noam Elata@NoamElata·11 Ara

Excited to share our new paper!

Daniel Soudry@soudry_daniel

Accelerate your transformer model with the new Block-Sparse-Flash-Attention! github.com/Danielohayon/B… This training-free, drop-in replacement extends FlashAttention-2 with minimal code changes (CUDA Kernels Included). Paper: arxiv.org/abs/2512.07011

English

0

1

7

125

Noam Elata@NoamElata·5 Ara

Come check out our poster “InvFusion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems”! arXiv: arxiv.org/abs/2504.01689 📅 Fri, Dec 5, 2025 ⏰ 4:30 PM – 7:30 PM PST 📍 Exhibit Hall C,D,E #4015

English

1

19

3K

Noam Elata retweetledi

Zahra Kadkhodaie@ZKadkhodaie·4 Ara

Are you at #NeurIPS2025 and interested in image density models? Come chat with us tomorrow (Friday) at the morning poster session (11-2) Poster 3700

Florentin Guth@FlorentinGuth

What is the probability of an image? What do the highest and lowest probability images look like? Do natural images lie on a low-dimensional manifold? In a new preprint with @ZKadkhodaie @EeroSimoncelli, we develop a novel energy-based model in order to answer these questions: 🧵

English

0

6

47

5.3K

Noam Elata retweetledi

Sander Dieleman@sedielem·4 Ara

📢 Another #NeurIPS, another diffusion circle! Join us to talk about diffusion models on Friday Dec 5 at 3:30PM in San Diego! Bayside terrace outside room 11 (upstairs) ☀️🚢🌊 Please help spread the word, tell your friends! No slides, no talks, we just sit down and chat 🗣️

English

7

34

215

63.3K

Noam Elata retweetledi

Tamar Rott Shaham@TamarRottShaham·1 Ara

I’ll be at NeurIPS from Dec 4-7, DM me if you want to chat! I’m presenting four papers this year: 📍 Thursday afternoon Poster #1002 @christy_li_ will present our self-reflective interpretability agent x.com/TamarRottShaha… ⬇️⬇️

Tamar Rott Shaham@TamarRottShaham

A key challenge for interpretability agents is knowing when they’ve understood enough to stop experimenting. Our @NeurIPSConf paper introduces a self-reflective agent that measures the reliability of its own explanations and stops once its understanding of models has converged.

English

2

7

25

2.2K

Noam Elata retweetledi

Assaf Shocher@AssafShocher·28 Kas

I am recruiting exceptional PhD students and Postdocs to join my research lab at the Technion. We study Deep Learning by pursuing creative, unconventional, elegant and mathematically rigorous ideas. 👇

English

3

9

30

4.7K

Noam Elata retweetledi

Sander Dieleman@sedielem·27 Kas

Lots of pixel space diffusion papers are doing the rounds this week, many of them seemingly rediscovering the benefits of the multiscale structure of UNets in the Transformer era🤭 Personally, I think we'll be doing latent diffusion for a while longer. The computational efficiency gains are large enough for people to put up with it, despite the additional complexity and relative inelegance of it all. Eventually, they will not be, and at that point people will probably switch back to pixel space diffusion to simplify their setups, and happily eat the associated efficiency cost. When? I'm not sure! Gemini 3 Pro says 2028-2030, so we have a few more years to go. My gut feeling is that might be an optimistic estimate. Any other guesses?

English

13

14

212

24.1K

Noam Elata@NoamElata·20 Kas

I’ll be at NeurIPS 25 in San Diego to present InvFusion (arxiv.org/abs/2504.01689), and follow up with a visit to the Bay Area to present at @Stanford and @Berkeley. Reach out if you want to talk about new research, diffusion models, efficient attention, or just grab a coffee :)

English

1

5

14

382

Noam Elata retweetledi

Tamar Rott Shaham@TamarRottShaham·5 Kas

A key challenge for interpretability agents is knowing when they’ve understood enough to stop experimenting. Our @NeurIPSConf paper introduces a self-reflective agent that measures the reliability of its own explanations and stops once its understanding of models has converged.

English

2

29

53

8.8K

Noam Elata retweetledi

Sean Man@sean_8100·22 Oca

🚀 Excited to share our latest research: “SILO: Solving Inverse Problems with Latent Operators”! A surprisingly simple approach to image restoration with latent diffusion models that achieves SOTA results while being 2.5x–10x faster than prior methods. 🧵[1/7]

English

3

16

42

5.1K

Noam Elata retweetledi

Bahjat Kawar@bahjat_kawar·13 Kas

Introducing VIVID, a new single-image NVS diffusion model, achieving SOTA results. Using EDM2 as a backbone network, we train our model from scratch, and explore different geometry encoding options. Work led by the amazing @NoamElata! arxiv.org/abs/2411.07765 [1/7]

English

1

11

78

8.8K

Noam Elata@NoamElata·30 Eyl

I am presenting AdaSense at #ECCV2024 tomorrow! AdaSense is a method for using a pre-trained diffusion model for adaptive compressed sensing, active acquisition and compression (arxiv.org/abs/2407.08256). Come meet me at tomorrow's morning poster session :)

English

1

3

25

1.1K

Noam Elata retweetledi

Bahjat Kawar@bahjat_kawar·17 Nis

Did you know you can train a good generative model, even if your training data is corrupted or noisy? Our paper, recently accepted to @TmlrOrg, does exactly that. 🧵 openreview.net/forum?id=BRl7f…

English

2

4

31

2K

Noam Elata retweetledi

Bahjat Kawar@bahjat_kawar·28 Tem

We are presenting Nested Diffusion, a diffusion schedule that can output great images even when arbitrarily stopped. Come meet us at SPIGM worskshop @icmlconf in Meeting Room 323 at 16:00 HST tomorrow #ICML2023

AK@_akhaliq

Nested Diffusion Processes for Anytime Image Generation propose an anytime diffusion-based method that can generate viable images when stopped at arbitrary times before completion. Using existing pretrained diffusion models, we show that the generation scheme can be recomposed as two nested diffusion processes, enabling fast iterative refinement of a generated image. We use this Nested Diffusion approach to peek into the generation process and enable flexible scheduling based on the instantaneous preference of the user. In experiments on ImageNet and Stable Diffusion-based text-to-image generation, we show, both qualitatively and quantitatively, that our method's intermediate generation quality greatly exceeds that of the original diffusion model, while the final slow generation result remains comparable. paper page: huggingface.co/papers/2305.19…

English

0

2

45

17.3K

Noam Elata retweetledi

Bahjat Kawar@bahjat_kawar·9 Haz

Diffusion too slow? Our recent paper Nested Diffusion allows you to peek at intermediate results, and stop the generation early when the image is good enough. Try our gradio demo: huggingface.co/spaces/noamela… Great work by @NoamElata et al. Huge thanks to @_akhaliq for sharing

AK@_akhaliq

Nested Diffusion Processes for Anytime Image Generation propose an anytime diffusion-based method that can generate viable images when stopped at arbitrary times before completion. Using existing pretrained diffusion models, we show that the generation scheme can be recomposed as two nested diffusion processes, enabling fast iterative refinement of a generated image. We use this Nested Diffusion approach to peek into the generation process and enable flexible scheduling based on the instantaneous preference of the user. In experiments on ImageNet and Stable Diffusion-based text-to-image generation, we show, both qualitatively and quantitatively, that our method's intermediate generation quality greatly exceeds that of the original diffusion model, while the final slow generation result remains comparable. paper page: huggingface.co/papers/2305.19…

English

0

19

86

29.3K

Noam Elata

Keşfet