
Code for VSTAR is released 🚀 It enables longer video synthesis w/o re-training and allows to control the dynamics of the synthesised video. Check it out 👇
Anna Khoreva
395 posts

@anna_khoreva
Head of Applied Science @Zalando, previously Senior Research Manager @Bosch_AI, PhD @cvml_mpiinf. GenAI and Computer Vision enthusiast. Opinions are my own.

Code for VSTAR is released 🚀 It enables longer video synthesis w/o re-training and allows to control the dynamics of the synthesised video. Check it out 👇










July has been a big month for Viser! - Released v1.0.0😊 - We did some writing Some demos👇



Have you ever been bothered by the constraints of fixed-sized 2D-grid tokenizers? We present FlexTok, a flexible-length 1D tokenizer that enables autoregressive models to describe images in a coarse-to-fine manner. flextok.epfl.ch arxiv.org/abs/2502.13967 🧵 1/n


FlexiDiT Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

🤗Code is out: github.com/boschresearch/… 🚀TL;DR: VSTAR generates longer videos with dynamic visual evolution in a single pass. No fine-tuning is needed! 🙌Check out our project page: yumengli007.github.io/VSTAR/ Bill Beluch @margret_keuper @isDanZhang @anna_khoreva @Bosch_AI ❤️


