Assaf Shocher

80 posts

Assaf Shocher

@AssafShocher

Assiatant Professor @TechnionLive

Katılım Temmuz 2020

231 Takip Edilen382 Takipçiler

Sabitlenmiş Tweet

Assaf Shocher@AssafShocher·6 Şub

Most of you probably heard about Invertible Neural Networks. But have you heard of *Pseudo*-Invertible Networks? And what does Pseudo-Inverse (PInv) of a non-linear function even mean? 👇

English

183

12K

Assaf Shocher@AssafShocher·1d

@geopavlakos And this is even without them knowing how good you are at getting agents to locate soccer balls! Well deserved, congrats!

English

Georgios Pavlakos@geopavlakos·2d

I'm honored to receive the NSF CAREER award! 🎉 Thank you to my students, mentors, and colleagues. Very grateful to NSF for their support.

Qixing Huang@qixing_huang

Congratulations to @geopavlakos on winning an NSF career award (nsf.gov/awardsearch/sh…). When George was hired two years ago, we did expect him to do well. Yet, his performance has exceeded our expectations: NSF medium + Career + a paper award at ICCV + Best thesis co-mentor

English

120

10.1K

Assaf Shocher@AssafShocher·25 Mar

@YarivLior Very nice!

English

127

Lior Yariv@YarivLior·25 Mar

Why pay full compute for pixels you're not even looking at? In our new work, Foveated Diffusion, we introduce a new concept for efficient image and video generation, motivated by how the human visual system works. (See full thread below)

Gordon Wetzstein@GordonWetzstein

High-resolution image and video generation is hitting a wall because attention in DiTs scales quadratically with token count. But does every pixel need to be in full resolution? Introducing Foveated Diffusion: a new approach for efficient diffusion-based generation that allocates compute where it matters most. 1/7🧵

English

125

18.7K

Assaf Shocher retweetledi

Amir Bar@_amirbar·17 Mar

Video generation models are zero shot planners, but their pixel level trajectories are often not feasible. Our idea: Map the generated video into a differentiable world model latent space and optimize for the nearest feasible trajectory which respects the environment dynamics.

Christos Ziakas@christos_ziakas

📢 Grounding Video Plans with World Models (GVP-WM) Video models show emerging capabilities as visual planners, yet I2V-generated plans often violate physics. GVP-WM projects video plans onto feasible trajectories in world models via latent-space trajectory optimization. By jointly optimizing latent states and actions while preserving semantic alignment with the video plan, GVP-WM recovers feasible plans from videos that violate real-world dynamics (e.g., object teleportation, motion blur). 🔗 Project page: chziakas.github.io/gvpwm/ 📄 arXiv: arxiv.org/pdf/2602.01960 🎉 Accepted to @iclr_conf #WorldModels Workshop w/ @_amirbar & Alessandra Russo

English

13K

Assaf Shocher@AssafShocher·18 Mar

@nickinpractice @huskydogewoof x.com/AssafShocher/s…

Assaf Shocher@AssafShocher

These are difficult times for many of us. Despite this, persevering with our work feels essential. I want to share my recent work together with @_AmilDravid , @YGandelsman ,@inbar_mosseri , @MikiRubinstein and Alyosha Efros. The idea occurred to me while watching Seinfeld. 1/

QME

Nikola Georgiev@nickinpractice·5 Şub

@huskydogewoof I recall reading a paper called Idempotent Generative Network. The idea of training a model such that the data distribution is a fixed point isn't new...

English

751

Benhao Huang@huskydogewoof·5 Şub

“In contrast, our work presents a conceptually different paradigm and does not rely on SDE or ODE formulations as in diffusion or flow models.” Reading it gave me a very particular feeling: an equilibrium flavored sweet spot that drifts away from, yet stays spiritually adjacent to (ranked from farther to closer), diffusion and flow matching, GAN style one shot generation, and DEQ style fixed point thinking. Instead of defining sampling as an explicit time dynamics, they learn a drifting field that moves samples, while training itself evolves the pushforward distribution. The “equilibrium” is not a metaphor, it is literally the point where the model distribution matches the data distribution. The punchline is delightfully clean: it naturally supports one step inference when optimal parameters that can achieve the equilibrium (V, the measure of distance between data and network distribution, being zero) is reached after training; and the ImageNet 256 numbers are strong (FID 1.54 in latent space, 1.61 in pixel space). ❓Question: are we all secretly chasing the same fixed point, just in different coordinate systems?

Zhengyang Geng@ZhengyangGeng

A new paradigm & member toward 1-step & e2e generative modeling! Great work by @Goodeat258 Mingyang!!! cannot be more excited to read me: learning to drift with my spindrift. arxiv.org/abs/2602.04770

English

190

37.4K

Assaf Shocher@AssafShocher·18 Mar

@iamnopilot There you go: x.com/AssafShocher/s…

Assaf Shocher@AssafShocher

English

owns an orange hat@iamnopilot·12 Şub

show me a generative model that is idempotent

Derek Thompson@DKThomp

There are still a lot of journalists and commentators that I follow who think AI is nothing of much significance—still just a mildly fancy auto complete machine that hallucinates half the time and can’t even think. If you’re in that category: What is something I could write, or show with my reporting and work, that might make you change your mind?

English

128

Assaf Shocher retweetledi

Ethan Weber@ethanjohnweber·17 Mar

I made a Claude Code skill that generates conference posters 🛠️ Instead of a static PDF, it outputs a single HTML file — drag to resize columns, swap sections, adjust fonts, then give your layout back to Claude. 🔁 🔗 Skill 👉 github.com/ethanweber/pos…

English

331

2.5K

182.3K

Assaf Shocher retweetledi

Amil Dravid@_AmilDravid·26 Şub

Considering submitting to our workshop How Do Vision Models Work @CVPR 2026! We have both a non-proceedings and proceedings track. More info at sites.google.com/view/how-cvpr-….

How Do Vision Models Work? @ CVPR2026 (Prev: MIV)@how_cvpr2026

📢 CVPR decisions are out. Some of you are celebrating. Some of you are "contemplating"🫠 We got you all: do you study how a vision model works? Submit to the HOW workshop @CVPR 2026! New Deadline: March 7, AoE (for both proceedings and non) Link: tinyurl.com/vuk2kysz

English

10.5K

Assaf Shocher retweetledi

Amir Bar@_amirbar·10 Şub

An interesting connection between Drifting models @Goodeat258 to Idempotent Generative Networks by @AssafShocher et al: start from drifting loss: L = E_z || f(z) − sg(f(z) + V(f(z))) ||² Set drift as the generator residual: V(x) = f(x) − x Recovering idempotence L = E_z || f(z) - sg(f(f(z)))|| . Eg, a sample which is already on the data manifold should not change!

Alexia Jolicoeur-Martineau@jm_alexia

Byebye diffusion, say hello to Drifting models. Drifting models will take over diffusion models within the next year. I was told many times that we figured it all out, that there was nothing else to invent in generative AI and it was just about scaling. Wrong again and again.

English

100

13.6K

Assaf Shocher@AssafShocher·8 Şub

@isosnovik @Yamitehr @NimrodBe Thank you for pointing out! This is true and we should have cited it, sorry. It is also similar structure to SurVAE. The difference is the use of the degree of freedom. Anyway, we will update and add a citation to PIE, thanks again.

English

Ivan Sosnovik@isosnovik·8 Şub

@AssafShocher @Yamitehr @NimrodBe Nice perspective on nonlinear pseudo-inverses. The SPNN construction seems structurally similar to PIE arxiv.org/pdf/2111.00619

English

Assaf Shocher@AssafShocher·6 Şub

Most of you probably heard about Invertible Neural Networks. But have you heard of *Pseudo*-Invertible Networks? And what does Pseudo-Inverse (PInv) of a non-linear function even mean? 👇

English

183

12K

Assaf Shocher@AssafShocher·6 Şub

This work was led by my talented student @Yamitehr and in collaboration with the great @NimrodBe. Project page: yamitehr.github.io/spnn-project-p…. ArXiv: arxiv.org/abs/2602.06042… Also thanks to Eliakim Hastings Moore and Sir. Roger Penrose. (Below is a snippet from Moore's 1920 paper).

English

563

Assaf Shocher@AssafShocher·6 Şub

Solving inverse problems with diffusion models is a hot topic (DDRM, DDNM, etc). We combine this with our PInv. We take a frozen diffusion model and apply our NLBP after every single timestep. The diffusion is responsible for the data prior, while our back-projection strictly forces the image to stay on the "manifold" of the target class (e.g., "Must be Smiling"). This is just one demo of what you can do with it and we have many plans for the future.

English

590

Assaf Shocher@AssafShocher·16 Ara

@LoosJonas The inputs to generative models certainly do.

English

Jonas@LoosJonas·16 Ara

@AssafShocher Then the question arises: Do neural networks operate basically on the surface of a hypersphere?

English

138

Assaf Shocher@AssafShocher·15 Ara

What do you see when you imagine a high-dimensional standard Gaussian? There is a known anecdote that I'm embarrassed to say I wasn't aware of. Apparently, my mental image was wrong for years and I just got my mind blown 🤯. Just in case I can save someone else the embarrassment: A high-dim Gaussian doesn't look like a blob, but like a spherical shell! The curse of dimensionality dictates an image different than our intuition. In the illustration below, I visualized a 2D slice of the distribution for increasing num of dims. The blue rings highlight the shell where 99.7% of the data actually lives (Mean ± 3σ). Notice the shift from a solid "blob" at N=2 to a hollow, razor-thin skin at N=100K. 👇

English

4.5K

Assaf Shocher@AssafShocher·16 Ara

@MaziyarPanahi My guess: Loss is calculated by dividing by hard-coded batch size, but the last batch in the epoch is smaller.

English

1.8K

Maziyar PANAHI@MaziyarPanahi·16 Ara

repeat after me, it's ALWAYS: dataset, dataset, dataset!

English

358

45.7K

Assaf Shocher@AssafShocher·16 Ara

@docmilanfar It depends on the objective. Data suggest that question titles get downloaded more but get less citations. (of course, you can challenge causality here). link.springer.com/article/10.100…

English

359

Peyman Milanfar@docmilanfar·16 Ara

because a research paper is expected to provide answers - but more specifically: - hides the result, it’s inefficient - teases the reader - shows lack of authority - Betteridge’s law, answer: “No” Eg: “Does sleep affect memory?” vs: “The effects of sleep on memory”

Anand Bhattad@anand_bhattad

@docmilanfar @CSProfKGD I am still trying to understand why a question should be avoided as a title

English

16.3K

Keşfet

@geopavlakos @YarivLior @nickinpractice @huskydogewoof @iamnopilot @CVPR @Goodeat258 @isosnovik