Willi Menapace

33 posts

Willi Menapace

@WilliMenapace

PhD Student - University of Trento, Italy

Trento, Trentino-South Tyrol Katılım Haziran 2021

86 Takip Edilen192 Takipçiler

Willi Menapace retweetledi

Alexander Pondaven@alexpondaven·3 Nis

Introducing ActionParty: the first video world model that controls up to 7 players simultaneously on the same screen across 46 game environments. We tackle the action binding problem in video diffusion, ensuring each player's action is applied to the right subject. 🧵

English

9.1K

Willi Menapace@WilliMenapace·19 Mar

Bringing real-time egocentric video editing to #CVPR2026! 🚀 It was a pleasure to supervise this fantastic collaboration at @Snapchat . We've just open-sourced our video editing dataset—check out the amazing work below! 👇👏

Runjia Li@RunjiaLi

🎉EgoEdit @Snapchat has been accepted to CVPR 2026! 🏆👻 We are bringing high-quality, real-time editing to egocentric videos. Our massive 100k video dataset and benchmark are ALREADY PUBLIC! 🔓🚀 🏠 Project Page: snap-research.github.io/EgoEdit/ 🤗 Dataset: huggingface.co/datasets/ligua…

English

939

Willi Menapace@WilliMenapace·14 Mar

Why allocate compute uniformly when not all pixels are equally hard? 🤔 Our new work, ELIT, solves DiT compute waste by focusing on hard regions. It also acts as a runtime knob, letting you easily dial your inference budget up or down. See you at CVPR 2026! 👇

Moayed Haji Ali@moayedhajiali

Not all pixels are equally hard, but DiTs still allocate compute uniformly across pixels, wasting efforts on easy regions. ELIT adds two lightweight cross-attention layers to focus compute where it matters, cutting FID by 53%. ELIT: snap-research.github.io/elit

English

469

Willi Menapace retweetledi

Jinjie Wayne Mai@JinjieMai·6 Oca

🧵1/N 🔥Introducing EasyV2V from Snap Research! 📌TLDR: We propose a simple and powerful model for instructional video editing! Just use a text prompt to edit your video! #Aiart #AIArtwork #Prompt #ArtificialIntelligence #GenerativeAI #MachineLearning #AI

English

4.6K

Willi Menapace retweetledi

Tsai-Shien Chen@tsaishien_chen·12 Ara

Work was done during my internship at @Snap Kudos to the amazing collaborators: @siarohin9013, @guocheng_qian, @kcjacksonwang, Egor Nemchinov, @moayedhajiali, @rizaalpguler, @WilliMenapace, @isskoro, @anilkagak2, @junyanz89, and @SergeyTulyakov

English

139

Willi Menapace@WilliMenapace·25 Kas

The Creative Vision Team is opening new internship opportunities for 2026 🚀 If you are excited about working on cutting-edge generative models research, check our open call 👇 snap-research.github.io/cv-call-for-in…

English

178

Willi Menapace@WilliMenapace·26 Haz

Why is progressive generation so complex? 🤔 It doesn't have to be. Our Decomposable Flow Matching (DFM) simplifies the process into a single, straightforward flow model, 🚀 beating prior work in image and video synthesis. #AI #Research #MachineLearning

Moayed Haji Ali@moayedhajiali

Where are good old progressive diffusion models? 🤔 Breaking generation to multiple resolution scales is a great idea, but complexity (multiple models, custom diffusion process, etc) stalled scaling. Our Decomposable Flow Matching packs multi-scale perks into one scalable model.

English

728

Willi Menapace retweetledi

Ashkan Mirzaei@ashmrz10·24 Haz

[1/9] 🚀 We introduce 4Real-Video-V2, a method that can generate 4D scenes from a simple text prompt, viewable from any angle at any moment in time. It’s fast, photorealistic, and works on full scenes. Here's how it works and why it matters. 👇 snap-research.github.io/4Real-Video-V2/

English

10K

Willi Menapace retweetledi

Snap Inc.@Snap·11 Haz

Heading to @CVPR 2025 in Nashville this week? So are we! We’re proud to have 12 papers accepted — including SnapGen and 4Real-Video, both highlighted among the top 3% of submissions. Come find us to learn more about the cutting edge work we’re doing in AI and computer vision. 📍 See you in Nashville! Learn more: newsroom.snap.com/snap-research-…

English

4.4K

Willi Menapace retweetledi

Ziyi Wu@Dazitu_616·5 Haz

📢 Introducing DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models Compared to vanilla DPO, we improve paired data construction and preference label granularity, leading to better visual quality and motion strength with only 1/3 of the data. 🧵

English

179

35.2K

Willi Menapace retweetledi

Ivan Skorokhodov@isskoro·21 Şub

In the past 1.5 weeks, there appeared 2 papers by 2 different research groups which develop the exactly same (and embarrassingly simple) trick to improve convergence of image/video diffusion models by 20-100+% (sic!) arxiv.org/abs/2502.14831 arxiv.org/abs/2502.09509

English

402

39.2K

Willi Menapace retweetledi

Kfir Aberman@AbermanKfir·10 Şub

We discovered that imposing a spatio-temporal weight space via LoRAs on DIT-based video models unlocks powerful customization! It captures dynamic concepts with precision and even enables composition of multiple videos together!🎥✨

English

605

59.5K

Willi Menapace retweetledi

Rameen Abdal@AbdalRameen·29 Oca

What if you could compose videos— merging multiple clips, even capturing complex athletic moves where video models struggle - all while preserving motion and context? And yes, you can still edit them with text after! Stay tuned for more results. #AI #VideoGeneration #SnapResearch

English

147

17.9K

Willi Menapace@WilliMenapace·15 Oca

Check out Video Alchemist Our latest work enables Multi-subject open-set personalization with no need for inference-time tuning 👇👇👇

Tsai-Shien Chen@tsaishien_chen

Introducing ⚗️ Video Alchemist Our new video model supporting 👪 Multi-subject open-set personalization 🏞️ Foreground & background personalization 🚀 Without the need of inference-time tuning snap-research.github.io/open-set-video… [Results] 1. Sora girl rides a dinosaur on a savanna 🧵👇

English

345

Willi Menapace@WilliMenapace·14 Oca

Video-to-Audio and Audio-to-Video models struggle with temporal alignment. AV-Link solves the problem by conditioning on diffusion model features Great collaboration with @moayedhajiali , @siarohin9013 , @isskoro , @alpercanbe , Kwot Sin Lee, Vicente Ordonez and @SergeyTulyakov

Moayed Haji Ali@moayedhajiali

Can pretrained diffusion models connect for cross-modal generation? 📢 Introducing AV-Link ♾ Bridging unimodal diffusion models in one framework to enable: 📽️ ➡️ 🔊 Video-to-Audio 🔊 ➡️ 📽️ Audio-to-Video 🌐: snap-research.github.io/AVLink/ 📄: hf.co/papers/2412.15… ⤵️ Results

English

843

Willi Menapace retweetledi

Ziyi Wu@Dazitu_616·17 Ara

MinT beats Sora in multi-event generation! One week after the release of MinT, Sora also released a *storyboard* tool that targets the same task (sequential events + time control). Below are a few comparisons, where MinT shows better event transition and timing: (1/N)

Ziyi Wu@Dazitu_616

📢MinT: Temporally-Controlled Multi-Event Video Generation📢 mint-video.github.io TL;DR: We identify a fundamental failure mode of existing video generators: they cannot produce videos with sequential events. MinT unlocks this capability with temporal grounding of events. 🧵

English

7.7K

Willi Menapace retweetledi

Ziyi Wu@Dazitu_616·5 Ara

English

189

33K

Willi Menapace retweetledi

Andrea Tagliasacchi 🇨🇦@taiyasaki·27 Kas

📢📢📢 𝐀𝐂𝟑𝐃: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers snap-research.github.io/ac3d TL;DR: for 3D camera control in generative video, it really helps knowing *which* part of your model you should mess with Internship by @sherwinbahmani at Snap

English

128

23K

Willi Menapace@WilliMenapace·23 Şub

Excited to share our latest work 'Snap Video'! A great collaboration with @siarohin9013 @isskoro @katedeyneka @tsaishien_chen @anilkagak2 @studyfang_ Aleksei Stoliar @eliricci_ @JianRen_ @SergeyTulyakov More to come soon! Project page: snap-research.github.io/snapvideo/

English

5.2K

Willi Menapace@WilliMenapace·26 Oca

@VGolyanik @eliricci_ @lambertoballan Thank you Vlad! It was my pleasure collaborating with you, @eliricci_ @Steph_lat , @SergeyTulyakov and @siarohin9013 on so many projects!

English

103

Vlad Golyanik@VGolyanik·26 Oca

Congratulations, Dr. @WilliMenapace, on today's successful thesis defence in the beautiful Trento! 🎾👏👏🙌 Co-supervising you was a great and enriching experience. Best of luck with your continuing scientific journey! ...in the photo with @eliricci_ and @lambertoballan

English

1.7K

Keşfet

@Snapchat @Snap @siarohin9013 @guocheng_qian @kcjacksonwang @moayedhajiali @rizaalpguler @isskoro