Yulia Rubanova
108 posts

Yulia Rubanova
@YuliaRubanova
Staff Research Scientist in Veo team at Deepmind. Veo Ingredients-to-Video (I/O 2025). Controllable video generation, learning physics in 3D, world models



We’re updating Veo 3.1 Ingredients to Video to help create more expressive and dynamic clips, produce better visual consistency and more. 📽️ Here’s what’s new 🧵

Generalist robots need a generalist evaluator. But how do you test safety without breaking things? 💥 🌎 Introducing our new work from @GoogleDeepMind: Evaluating Gemini Robotics Policies in a Veo World Simulator veo-robotics.github.io 🧵👇

Generalist robots need a generalist evaluator. But how do you test safety without breaking things? 💥 🌎 Introducing our new work from @GoogleDeepMind: Evaluating Gemini Robotics Policies in a Veo World Simulator veo-robotics.github.io 🧵👇

You went 🍌🍌 for Nano Banana. Now, meet Nano Banana Pro. It’s SOTA for image generation + editing with more advanced world knowledge, text rendering, precision + controls. Built on Gemini 3, it’s really good at complex infographics - much like how engineers see the world:)

We’re back with another update to Veo 3.1: Rolling out now on mobile and desktop, you can upload multiple reference images alongside your video prompts, to create entirely new worlds and more nuanced videos that are true to your vision.


🖼️ Ingredients to video Give multiple reference images with different people and objects, and watch how Veo integrates these into a fully-formed scene - complete with sound.

🚨🎬 Big news from Video Arena! @GoogleDeepMind’s latest Veo 3.1 now ranks #1 in both Text-to-Video and Image-to-Video leaderboards. 🏆 This is a +30-point leap from Veo 3.0 → 3.1, making it the first model to break 1400 in Video Arena history! Huge congrats to the @GoogleDeepMind team for pushing the frontier of video generation forward! More details in the thread 🧵

K-Pop performance @corl_conf banquet by @geeniusofficial #CoRL2025

🔥Veo 3 has emergent zero-shot learning and reasoning capabilities! This multitalented model can do a huge range of interesting tasks. It understands physical properties, can manipulate objects, and can even reason. Check out more examples in this thread!

We posed a creative challenge to our Discord community: start with the exact same ingredients and a prompt to create something in Flow. The results... delightful! ✨

Google just build the craziest AI photo editor ever Nano Banana in Gemini is wild People are already dropping insane use cases 10 wild examples : 1. Model pose like the sketch

Image generation with Gemini just got a bananas upgrade and is the new state-of-the-art image generation and editing model. 🤯 From photorealistic masterpieces to mind-bending fantasy worlds, you can now natively produce, edit and refine visuals with new levels of reasoning, control and creativity. A quick dive into Gemini 2.5 Flash’s capabilities 🧵

Keeping characters consistent in your AI films can be difficult. Flow's Ingredients to Video feature makes character consistency a breeze - and better yet, it's now available to both Ultra AND Pro users! 🔥

In filmmaking, consistency is key to a strong narrative. Using Flow, you can build stories with consistent characters, scenes, and objects across multiple clips— giving you more creative control.

Veo Ingredients are now available in @FlowbyGoogle to all Google AI Pro plan subscribers! We've been working hard on this capability over the past couple of months. Excited to get this into the hands of everyone! Get consistent characters, objects and scenes into your worlds, like in this little fun lunar exploration project I put together in just a few minutes 🧑🚀