Wei Yu (@GnosisYu) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

Wei Yu@GnosisYu·4d

Dropping an exciting new demo of MosaicMem! 👀🔥 A friend brought up a great question: why not combine long-horizon navigation video generation, promptable world events, and scene concatenation? Fair point — so we gave it a shot. 🎬✨ For more technical details, check this thread 🧵👇 x.com/GnosisYu/statu… #WorldModel #GenerativeAI #VideoGeneration #InteractiveAI #Genie3 #EmbodiedAI #GameAI

English

0

18

104

7.9K

Wei Yu retweetledi

Andrea Tagliasacchi 🇨🇦@taiyasaki·3d

📢📢📢 Introducing "FullCircle: Effortless 3D Reconstruction from Casual 360° Captures" TL;DR: 10x faster casual capture with clean reconstructions Homepage: theialab.github.io/fullcircle Code: github.com/theialab/fullc… arXiv: arxiv.org/abs/2603.22572 Led by Yalda Foroutan & Ipek Oztas

English

2

70

366

32.3K

Wei Yu retweetledi

Yumeng Li@YumengLi_007·4d

🚀 Check out our new demo using MosaicMem! It combines long-horizon navigation, promotable events and multi-scene concatenation 🌏

Wei Yu@GnosisYu

Dropping an exciting new demo of MosaicMem! 👀🔥 A friend brought up a great question: why not combine long-horizon navigation video generation, promptable world events, and scene concatenation? Fair point — so we gave it a shot. 🎬✨ For more technical details, check this thread 🧵👇 x.com/GnosisYu/statu… #WorldModel #GenerativeAI #VideoGeneration #InteractiveAI #Genie3 #EmbodiedAI #GameAI

English

1

3

6

885

Wei Yu retweetledi

Adina Yakup@AdinaYakup·27 Mar

Matrix-Game 3.0🔥real-time interactive world models from @Skywork_ai huggingface.co/Skywork/Matrix… ✨ MIT license ✨ 720p @ 40FPS with a 5B model ✨ Minute-long memory consistency ✨ Unreal + AAA + real-world data ✨ Scales up to 28B MoE

English

10

105

626

42.6K

Wei Yu retweetledi

Chris Paxton@chris_j_paxton·26 Mar

x.com/i/article/2037…

ZXX

18

63

355

116.8K

Wei Yu retweetledi

Samarth Sinha@_sam_sinha_·23 Mar

Uni-v1 is finally here!! A state of the art model that outperforms Nano Banana / GPT along many axes that you can now try for free 👀 🥰 Building this model brick by brick from scratch was the highlight of my life ❤️ model is fully available now! We are COOKING rn @LumaLabsAI

Luma@LumaLabsAI

Uni-1 is here! A new kind of model that thinks and generates pixels simultaneously. Less artificial. More intelligent.

English

3

10

83

6.4K

Wei Yu retweetledi

Danfei Xu@danfei_xu·23 Mar

Introducing EgoVerse: an ecosystem for robot learning from egocentric human data. Built and tested by 4 research labs + 3 industry partners, EgoVerse enables both science and scaling 1300+ hrs, 240 scenes, 2000+ tasks, and growing Dataset design, findings, and ecosystem 🧵

English

31

159

812

221.4K

Wei Yu retweetledi

Cristóbal Valenzuela@c_valenzuelab·22 Mar

8 years ago, it was just a toy. All things start as a toy, when they don’t have a name or feel strange, that’s where the most interesting things are happening.

Cristóbal Valenzuela@c_valenzuelab

Playing with text-to-image generation is really interesting. I made a small app to generate images in real time while you type. It's like having a visual translator. Next step, port it to @runwayml I'll add more videos in this thread. Some results are very surprising.

English

11

122

9.9K

Wei Yu retweetledi

Animesh Garg@animesh_garg·20 Mar

x.com/i/article/2035…

ZXX

4

15

133

30.8K

Wei Yu@GnosisYu·21 Mar

Adding a few tags for visibility 👇 #Genie3 #WorldModels #VideoGeneration #AI #Game

English

0

6

441

Wei Yu@GnosisYu·20 Mar

World models have made impressive progress in video generation, yet they still struggle with a fundamental challenge: memory. In long rollouts, the camera trajectory gradually drifts from the user-specified motion and revisited scenes no longer align with earlier observations. These errors accumulate over time, causing the generated world to steadily lose coherence. 🚀Excited to share our solution MosaicMem 🌍🧠 — our new hybrid spatial memory for video world models. Project Page: mosaicmem.github.io/mosaicmem/ Paper: huggingface.co/papers/2603.17…

English

18

52

240

185.1K

Wei Yu@GnosisYu·20 Mar

Huge thanks to all the amazing collaborators on this work 🙏 @Runjia Qian @MqquQ0nIwoXrU4M @Yumeng Li @YumengLi_007 @Liquan Wang @LiCHO13676030 @Songheng Yin @zha_bu_duo_deLe Sri Siddarth Chakaravarthy P @Dennis Anthony @DennisAnthony__ @Yang Ye @YangYe20161101 Yidi Li Weiwei Wan @Animesh Garg @animesh_garg Couldn’t have done this without you all — truly appreciate the effort and collaboration! 🚀

English

0

11

698

Wei Yu@GnosisYu·20 Mar

@HuggingPapers Thanks so much @HuggingPapers for sharing our work! 🙏 We’ve just updated the post today with a more detailed demo video — would love for you to check it out! x.com/GnosisYu/statu…

Wei Yu@GnosisYu

World models have made impressive progress in video generation, yet they still struggle with a fundamental challenge: memory. In long rollouts, the camera trajectory gradually drifts from the user-specified motion and revisited scenes no longer align with earlier observations. These errors accumulate over time, causing the generated world to steadily lose coherence. 🚀Excited to share our solution MosaicMem 🌍🧠 — our new hybrid spatial memory for video world models. Project Page: mosaicmem.github.io/mosaicmem/ Paper: huggingface.co/papers/2603.17…

English

0

1

29

Wei Yu retweetledi

DailyPapers@HuggingPapers·19 Mar

MosaicMem A hybrid spatial memory for video world models bridging explicit 3D and implicit memory, enabling long-horizon navigation, memory-based editing, and dynamic scene generation with improved camera consistency.

English

4

9

47

3.4K

Wei Yu@GnosisYu·20 Mar

@kwangmoo_yi Huge thanks @kwangmoo_yi for the share!! 🙏 We just updated the post with a more detailed demo video today — would really appreciate you taking a look! x.com/GnosisYu/statu…

Wei Yu@GnosisYu

World models have made impressive progress in video generation, yet they still struggle with a fundamental challenge: memory. In long rollouts, the camera trajectory gradually drifts from the user-specified motion and revisited scenes no longer align with earlier observations. These errors accumulate over time, causing the generated world to steadily lose coherence. 🚀Excited to share our solution MosaicMem 🌍🧠 — our new hybrid spatial memory for video world models. Project Page: mosaicmem.github.io/mosaicmem/ Paper: huggingface.co/papers/2603.17…

English

0

2

171

Wei Yu retweetledi

Kwang Moo Yi@kwangmoo_yi·20 Mar

Yu et al., "MosaicMem: Hybrid Spatial Memory for Controllable Video World Models" A patch-based spatial memory that you raster into views + glues to make things work.

English

6

18

146

11.1K

Wei Yu@GnosisYu·20 Mar

A key strength of MosaicMem lies in enabling dynamic, evolving scenes 🎮🌍. It supports promptable world events while preserving a persistent background, effectively combining controllable dynamics with reliable camera control. In addition, it enables memory manipulation for scene editing—composing elements from different scenes into a unified generation, creating composite experiences that do not exist in reality while maintaining spatial coherence.

English

1

11

935

Wei Yu@GnosisYu·20 Mar

The core problem is memory. However, the puzzling part is that today’s memory designs perform well in some regimes yet fail in others, motivating a closer examination of prevailing paradigms and their limitations. - explicit 3D memory (reprojection-based): geometrically solid, but fails to preserve the prompt-following capability of pretrained models. - implicit latent memory: handles dynamics fine, but camera drift creeps in and compounds Instead of posing this as a tradeoff, MosaicMem combines the best of both worlds.