Ben Poole

3.1K posts

Ben Poole

Ben Poole

@poolio

research scientist at google brain. phd in neural nonsense from stanford.

Stanford, CA Katılım Mart 2008
1.5K Takip Edilen21.5K Takipçiler
Sabitlenmiş Tweet
Ben Poole
Ben Poole@poolio·
knock knock #veo3
English
3
18
172
18.4K
SpAItial AI
SpAItial AI@SpAItial_AI·
We wired 3D Gaussian Splatting into the 1999 Quake 3 engine - it's fully playable! Echo-2 generates the game levels, and it's rendering spaces directly inside the open source version of id Tech 3. Anyone wants to try it?
English
14
20
183
15.2K
SpAItial AI
SpAItial AI@SpAItial_AI·
Echo-2 can generate a diverse set of environments. Results are spatially-persistent by design, a critical difference from many video models. We use 3DGS for extremely fast, real-time rendering; visual quality achieves state-of-the-art using the world score evaluation.
English
3
0
25
3K
SpAItial AI
SpAItial AI@SpAItial_AI·
🚀Echo-2 is here - our new world model! These aren’t videos. These are 𝟑𝐃 𝐬𝐜𝐞𝐧𝐞𝐬. Generated from a single image. - Stunning visual quality. - Real-time rendering. - Interactive camera control. - Physically grounded. 🧵More details👇
English
20
87
523
93.8K
Ben Poole retweetledi
Radu Soricut
Radu Soricut@RSoricut·
Meet Vision Banana 🍌 from @GoogleDeepMind! We provide strong evidence that image generators are generalist vision learners. Traditional computer vision tasks (segmentation, depth estimation, normal prediction) can now be performed at/near SOTA with a single generalist model derived from an image generation model. 🖼️ Explore the results: vision-banana.github.io 📄 See details at: arxiv.org/abs/2604.20329
Radu Soricut tweet media
English
28
93
630
77.1K
Ben Poole retweetledi
Peter Holderrieth
Peter Holderrieth@peholderrieth·
We release Diamond Maps💎 unlocking accurate and efficient guidance for diffusion models. Our experiments show that our methods scale incredibly well. Excited to see what people will build with this! Accurate guidance has been a notoriously hard problem, but in this work, we’re bringing TWO (!) solutions to the table. The recipe for success: 1️⃣ Speed: Use distilled models (flow maps, mean flows, consistency models). 2️⃣ Exploration: Inject stochasticity to properly explore your search space. Because this fundamentally improves anything using flow matching and diffusion, we see a lot of potential for applications across audio, robotics, molecules, and beyond. Paper: arxiv.org/abs/2602.05993 Code: github.com/PeterHolderrie… Huge thanks to an amazing team: Douglas Chen, @LucaEyring, @ishin_shah, Giri Anantharaman, @electronickale, @zeynepakata, Tommi Jaakkola, @nmboffi, and @max_simchowitz. It was awesome bringing this to life together!
English
2
43
247
57.3K
Tengfei Wang
Tengfei Wang@DylanTFWang·
Genie3 generates videos. We generate 𝟯𝗗 𝘄𝗼𝗿𝗹𝗱𝘀 you can actually use. Launching tomorrow — Tencent #HYWorld 2.0, an engine-ready World Model🚀 This isn't a video. It's a real 3D scene, all generated & editable. One image in. A whole 3D world out. 🔥Open-source tomorrow
English
193
469
4.2K
441.7K
Ben Poole retweetledi
Jesse Engel
Jesse Engel@jesseengel·
Today, we're open sourcing the code behind "The Infinite Crate," our VST/AU plugin that lets you play with Lyria RealTime directly inside your favorite DAW! 🎧🎶 Since we released The Infinite Crate last year, it’s been used by some of our favorite artists - including a wonderful showcase with @daitomanabe in Tokyo - and being featured as a top new music tool at NAMM 2026. We’re now fully open sourcing the plugin for developers to fork, modify, and make their own under the permissive Apache 2.0 license. 💾 Get it here: github.com/magenta/the-in…
English
9
43
206
20.2K
Robin Rombach
Robin Rombach@robrombach·
New paper out! We present a training method for multimodal generative models, called Self-Flow, which combines classic flow matching and representation learning. Why? Unlike most representation alignment methods, our new approach does not require external, pretrained models and thus scales gracefully to joint multimodal training on images, videos and audio. How? It combines per-timestep flow matching with dual-timestep representation learning, improving the models' internal representations. This approach outperforms prior methods and shows promising scaling behavior in multimodal pretraining. It also enables downstream applications such as action prediction for embodied AI. webpage+paper: bfl.ai/research/self-… code: github.com/black-forest-l… Credit to @hila_chefer, @pess_r, Dominik, @dustin_podell, Vikash, @Vinh_Suhi and Antonio. If you enjoy doing open research like this, come and join BFL! We are actively hiring🌲
Robin Rombach tweet media
English
5
36
310
27.5K
Ben Poole retweetledi
Jonathan Heek
Jonathan Heek@JonathanHeek·
1/6 Introducing Unified Latents: what if your diffusion model's latents were measured in bits? Instead of relying on dimensionality reduction, we learn a latent AE with explicit bitrate control. Paper: arxiv.org/abs/2602.17270 @emiel_hoogeboom, @TimSalimans
Jonathan Heek tweet mediaJonathan Heek tweet media
English
10
53
404
68K
Ben Poole retweetledi
Kording Lab 🦖
Kording Lab 🦖@KordingLab·
neuroAI comparisons of ANNs to brains do have a range of problems. Even more than I had realized. And I was worried before: biorxiv.org/content/10.110…
English
6
24
101
11K
Ben Poole
Ben Poole@poolio·
World models for the real world! Awesome work with incredible colleagues at Waymo 🤖🚙
Google DeepMind@GoogleDeepMind

Genie 3 🤝 @Waymo The Waymo World Model generates photorealistic, interactive environments to train autonomous vehicles. This helps the cars navigate rare, unpredictable events before encountering them in reality. 🧵

English
3
2
65
7K
Ben Poole retweetledi
Waymo
Waymo@Waymo·
We’re excited to introduce the Waymo World Model—a frontier generative mode for large-scale, hyper-realistic autonomous driving simulation built on @GoogleDeepMind’s Genie 3. By simulating the “impossible”, we proactively prepare the Waymo Driver for some of the most rare and complex scenarios—from tornadoes to planes landing on freeways—long before it encounters them in the real world. waymo.com/blog/2026/02/t…
GIF
English
129
478
3.9K
999.4K
Ben Poole retweetledi
Google Labs
Google Labs@GoogleLabs·
🚨NEW LABS EXPERIMENT🚨 Introducing Project Genie, an experimental prototype that lets you create and explore infinitely diverse worlds! Prompt with images or text to create a living, expanding world that builds itself in real-time around you. Access is rolling out today to Google AI Ultra subscribers (US only, 18+) Learn more: labs.google/projectgenie
English
100
306
2.4K
170.9K
Ben Poole retweetledi
Google
Google@Google·
Introducing Project Genie: An experimental research prototype powered by Genie 3, our world model, that lets you prompt an interactive world into existence — and then step inside 🌎
English
393
671
4.7K
1.6M
Ben Poole retweetledi
Justine Moore
Justine Moore@venturetwins·
I got early access to Project Genie from @GoogleDeepMind ✨ It's unlike any realtime world model I've tried - you generate a scene from text or a photo, and then design the character who gets to explore it. I tested dozens of prompts. Here are the standout features 👇
English
81
171
1.8K
229.3K
Ben Poole retweetledi
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
🧞‍♂️🧞‍♂️🧞‍♂️
ART
171
102
2K
939.1K
Ben Poole retweetledi
Adam Roberts
Adam Roberts@ada_rob·
I'm proud to have had @JeffDean as my lead at Google for the last 10 years. He is one of the few people of his stature in our industry always willing to stand up and say the obvious. We can't let this administration scare us away from acknowledging the reality we are living in. It's time to stand up and use our voices to end this.
Jeff Dean@JeffDean

This is absolutely shameful. Agents of a federal agency unnecessarily escalating, and then executing a defenseless citizen whose offense appears to be using his cell phone camera. Every person regardless of political affiliation should be denouncing this.

English
36
226
3.2K
152.5K
Ben Poole
Ben Poole@poolio·
While generative AI continues to impress, it takes a dedicated group of incredible artists, storytellers, and researchers to create a compelling film. Check out the awesome work from Connie and team on Dear Upstairs Neighbors:
Google DeepMind@GoogleDeepMind

Our short film Dear Upstairs Neighbors is previewing at @sundancefest. 🎬 It’s a story about noisy neighbors, but behind the scenes, it’s about solving a huge challenge in generative AI: control. Developed by Pixar alumni, an Academy Award winner, researchers, and engineers, here’s how it came together. 🎨

English
1
0
22
2.8K
Ben Poole retweetledi
Kyle Sargent
Kyle Sargent@KyleSargentAI·
Vision-language models are getting better every day. Can we use them to improve image compression? Yes! For my internship, working w/ @GoogleDeepMind, @GoogleResearch, we designed VLIC, a diffusion autoencoder post-trained with VLM preferences. Our preprint is out today! A🧵:
Kyle Sargent tweet media
English
5
39
314
44.9K