Michal Stary (@MichalStaryy) - Twitter Profili | Zamantika Mersobahis Locabet

Michal Stary@MichalStaryy·24 Nis

If you are at #ICLR2026 🇧🇷, come to check our poster tomorrow (Sat) morning at Pavillon 4 PA-#3016.

Check out our #ICLR2026 paper Generative View Stitching! I unfortunately couldn’t attend but @MichalStaryy will be presenting our poster tomorrow (Sat) morning at Pavillon 4 PA-#3016. Shoutout to my other collaborators @BoyuanChen0, @gkopanas, and @vincesitzmann!

English

0

3

231

Michal Stary retweetledi

Chonghyuk (ND) Song@ndsong95·24 Nis

Check out our #ICLR2026 paper Generative View Stitching! I unfortunately couldn’t attend but @MichalStaryy will be presenting our poster tomorrow (Sat) morning at Pavillon 4 PA-#3016. Shoutout to my other collaborators @BoyuanChen0, @gkopanas, and @vincesitzmann!

Chonghyuk (ND) Song@ndsong95

Introducing Generative View Stitching (GVS), a non-autoregressive sampling method for length extrapolation of video diffusion models. GVS enables collision-free camera-guided video generation for predefined trajectories, including Oscar Reutersvärd's Impossible Staircase (1/9).

English

1

9

43

6.1K

Michal Stary retweetledi

Kwang Moo Yi@kwangmoo_yi·28 Mar

Zhu et al., "GaussFusion: Improving 3D Reconstruction in the Wild with Geometry-Informed Video Generator" Another diffusion "fixer" BUT with now a geometry buffer (providing all the info about what IS being rendered)

English

3

6

82

6K

Michal Stary retweetledi

Andrew Davison@AjdDavison·28 Kas

Makes sense. Matching and 3D reconstruction are inherently iterative computations; this nice paper gives hints on how DUSt3R's transformer achieves that. Now we can get on with figuring out how to do it tens or hundreds of times better/faster with a more specific architecture.

Dmytro Mishkin 🇺🇦@ducha_aiki

Understanding Multi-View Transformers Michal Stary @jgaubil @_atewari @vincesitzmann tl;dr: DUSt3R self-attention is it secretly a diffusion model, and cross-attention is matching. arxiv.org/abs/2510.24907

English

4

24

194

21.6K

Michal Stary retweetledi

Dmytro Mishkin 🇺🇦@ducha_aiki·27 Kas

Understanding Multi-View Transformers Michal Stary @jgaubil @_atewari @vincesitzmann tl;dr: DUSt3R self-attention is it secretly a diffusion model, and cross-attention is matching. arxiv.org/abs/2510.24907

English

2

50

253

37.2K

Michal Stary retweetledi

George Kopanas@gkopanas·2 Kas

Did you ever want to navigate an 18 story mansion? Well, now you can do that without even retraining your Diffusion Forcing video model. Check the excellent work we did with @ndsong95 @MichalStaryy @BoyuanChen0 @vincesitzmann andrewsonga.github.io/gvs/

Chonghyuk (ND) Song@ndsong95

GVS also stably scales to longer videos (1080 frames) given more test-time compute, establishing itself as a promising alternative to autoregression for long video generation. Note that this video is generated without any keyframe interpolation! (8/9)

English

1

10

58

12.7K

Michal Stary retweetledi

Kwang Moo Yi@kwangmoo_yi·30 Eki

Stary and Gaubil et al., "Understanding multi-view transformers" We use Dust3r as a black box. This work looks under the hood at what is going on. The internal representations seem to "iteratively" refine towards the final answer. Quite similar to what goes on in point cloud net

English

2

14

79

6.7K

Michal Stary@MichalStaryy·9 Kas

Check out our work that illucidates hidden mechanisms behind multi-view transformers for 3D reconstruction like DUSt3R!

Julien Gaubil@jgaubil

DUSt3R et al. are impressive, but how do they actually work? We explored this, and share insights on iterative reconstruction, the roles of cross- and self-attention, and emerging correspondences across the network [1/8] ⬇️

English

0

1

43

Michal Stary@MichalStaryy·2 Kas

Checkout our latest work on long-video generation! Contrary to autoregressive rollouts, GVS respects a predefined long-horizon camera trajectory and generates worlds that never collide. Unlocking generation of 18-floor houses without running through the walls and much more!

Chonghyuk (ND) Song@ndsong95

Introducing Generative View Stitching (GVS), a non-autoregressive sampling method for length extrapolation of video diffusion models. GVS enables collision-free camera-guided video generation for predefined trajectories, including Oscar Reutersvärd's Impossible Staircase (1/9).

English

0

65

Michal Stary retweetledi

Chonghyuk (ND) Song@ndsong95·31 Eki

Introducing Generative View Stitching (GVS), a non-autoregressive sampling method for length extrapolation of video diffusion models. GVS enables collision-free camera-guided video generation for predefined trajectories, including Oscar Reutersvärd's Impossible Staircase (1/9).

English

8

40

206

73.2K

Michal Stary

Keşfet