Wei Yu

118 posts

Wei Yu

Wei Yu

@GnosisYu

PhD @ University of Toronto soon graduating. I work on controllable video generation & world models inspired by Bayesian Brain Theory. Open to AI/ML roles!

Toronto Katılım Mart 2018
1.6K Takip Edilen328 Takipçiler
Wei Yu retweetledi
Wei Yu retweetledi
Adina Yakup
Adina Yakup@AdinaYakup·
Matrix-Game 3.0🔥real-time interactive world models from @Skywork_ai huggingface.co/Skywork/Matrix… ✨ MIT license ✨ 720p @ 40FPS with a 5B model ✨ Minute-long memory consistency ✨ Unreal + AAA + real-world data ✨ Scales up to 28B MoE
English
10
105
626
42.6K
Wei Yu retweetledi
Danfei Xu
Danfei Xu@danfei_xu·
Introducing EgoVerse: an ecosystem for robot learning from egocentric human data. Built and tested by 4 research labs + 3 industry partners, EgoVerse enables both science and scaling 1300+ hrs, 240 scenes, 2000+ tasks, and growing Dataset design, findings, and ecosystem 🧵
English
31
159
812
221.4K
Wei Yu retweetledi
Cristóbal Valenzuela
Cristóbal Valenzuela@c_valenzuelab·
8 years ago, it was just a toy. All things start as a toy, when they don’t have a name or feel strange, that’s where the most interesting things are happening.
Cristóbal Valenzuela@c_valenzuelab

Playing with text-to-image generation is really interesting. I made a small app to generate images in real time while you type. It's like having a visual translator. Next step, port it to @runwayml I'll add more videos in this thread. Some results are very surprising.

English
11
11
122
9.9K
Wei Yu
Wei Yu@GnosisYu·
World models have made impressive progress in video generation, yet they still struggle with a fundamental challenge: memory. In long rollouts, the camera trajectory gradually drifts from the user-specified motion and revisited scenes no longer align with earlier observations. These errors accumulate over time, causing the generated world to steadily lose coherence. 🚀Excited to share our solution MosaicMem 🌍🧠 — our new hybrid spatial memory for video world models. Project Page: mosaicmem.github.io/mosaicmem/ Paper: huggingface.co/papers/2603.17…
English
18
52
240
185.1K
Wei Yu retweetledi
DailyPapers
DailyPapers@HuggingPapers·
MosaicMem A hybrid spatial memory for video world models bridging explicit 3D and implicit memory, enabling long-horizon navigation, memory-based editing, and dynamic scene generation with improved camera consistency.
English
4
9
47
3.4K
Wei Yu retweetledi
Kwang Moo Yi
Kwang Moo Yi@kwangmoo_yi·
Yu et al., "MosaicMem: Hybrid Spatial Memory for Controllable Video World Models" A patch-based spatial memory that you raster into views + glues to make things work.
English
6
18
146
11.1K
Wei Yu
Wei Yu@GnosisYu·
A key strength of MosaicMem lies in enabling dynamic, evolving scenes 🎮🌍. It supports promptable world events while preserving a persistent background, effectively combining controllable dynamics with reliable camera control. In addition, it enables memory manipulation for scene editing—composing elements from different scenes into a unified generation, creating composite experiences that do not exist in reality while maintaining spatial coherence.
English
1
1
11
935
Wei Yu
Wei Yu@GnosisYu·
The core problem is memory. However, the puzzling part is that today’s memory designs perform well in some regimes yet fail in others, motivating a closer examination of prevailing paradigms and their limitations. - explicit 3D memory (reprojection-based): geometrically solid, but fails to preserve the prompt-following capability of pretrained models. - implicit latent memory: handles dynamics fine, but camera drift creeps in and compounds Instead of posing this as a tradeoff, MosaicMem combines the best of both worlds.
English
0
1
16
1.5K
Wei Yu
Wei Yu@GnosisYu·
@HuggingPapers Thanks @HuggingPapers for sharing! I’ll be posting a more detailed breakdown of our Mosaic Memory tomorrow—stay tuned 👀
English
0
0
1
39
Wei Yu
Wei Yu@GnosisYu·
@lily_goli Same here! This was probably the best reviewing experience I’ve had so far — lots of genuinely enjoyable papers.
English
0
0
1
105
Lily Goli
Lily Goli@lily_goli·
wow I guess I was lucky with this year's CVPR reviews, more than half of mine were actually nice papers I enjoyed reading.
English
3
0
28
3.2K