
Wei Yu
118 posts

Wei Yu
@GnosisYu
PhD @ University of Toronto soon graduating. I work on controllable video generation & world models inspired by Bayesian Brain Theory. Open to AI/ML roles!



Dropping an exciting new demo of MosaicMem! 👀🔥 A friend brought up a great question: why not combine long-horizon navigation video generation, promptable world events, and scene concatenation? Fair point — so we gave it a shot. 🎬✨ For more technical details, check this thread 🧵👇 x.com/GnosisYu/statu… #WorldModel #GenerativeAI #VideoGeneration #InteractiveAI #Genie3 #EmbodiedAI #GameAI


Uni-1 is here! A new kind of model that thinks and generates pixels simultaneously. Less artificial. More intelligent.

Playing with text-to-image generation is really interesting. I made a small app to generate images in real time while you type. It's like having a visual translator. Next step, port it to @runwayml I'll add more videos in this thread. Some results are very surprising.



World models have made impressive progress in video generation, yet they still struggle with a fundamental challenge: memory. In long rollouts, the camera trajectory gradually drifts from the user-specified motion and revisited scenes no longer align with earlier observations. These errors accumulate over time, causing the generated world to steadily lose coherence. 🚀Excited to share our solution MosaicMem 🌍🧠 — our new hybrid spatial memory for video world models. Project Page: mosaicmem.github.io/mosaicmem/ Paper: huggingface.co/papers/2603.17…

World models have made impressive progress in video generation, yet they still struggle with a fundamental challenge: memory. In long rollouts, the camera trajectory gradually drifts from the user-specified motion and revisited scenes no longer align with earlier observations. These errors accumulate over time, causing the generated world to steadily lose coherence. 🚀Excited to share our solution MosaicMem 🌍🧠 — our new hybrid spatial memory for video world models. Project Page: mosaicmem.github.io/mosaicmem/ Paper: huggingface.co/papers/2603.17…









