Ali Razavi

180 posts

Ali Razavi

@catamorphist

Research Scientist @GoogleDeepmind Working on generative models (Veo, Imagen,...) in the GenMedia team.

San Francisco, CA Katılım Temmuz 2010

964 Takip Edilen562 Takipçiler

Ali Razavi retweetledi

Google DeepMind@GoogleDeepMind·2d

We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video. It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵

English

390

1.3K

8.3K

1.1M

Ali Razavi retweetledi

koray kavukcuoglu@koraykv·20 Kas

Gemini 3 Pro 🤝 Nano Banana Pro New SOTA image generation and editing. 🍌Production-ready visuals with improved precision and control 🍌Images grounded in Gemini’s real world understanding 🍌Superior text rendering 🍌Localization in multiple languages 🍌Physically accurate lighting and dynamics. And character consistency across 14 (fluffy) inputs! blog.google/technology/ai/…

English

567

103.7K

Ali Razavi retweetledi

Demis Hassabis@demishassabis·14 Kas

also this is amazingly cool: SIMA 2 playing in the mind of Genie 3: x.com/GoogleDeepMind…

Google DeepMind@GoogleDeepMind

SIMA 2 🤝 Genie 3 We tested SIMA 2’s abilities in simulated 3D worlds created by our world model Genie 3. It demonstrated unprecedented adaptability by navigating its surroundings and took meaningful steps toward goals.

English

175

38K

Ali Razavi retweetledi

Stefano Ermon@StefanoErmon·29 Eki

Tired of chasing references across dozens of papers? This monograph distills it all: the principles, intuition, and math behind diffusion models. Thrilled to share!

Chieh-Hsin (Jesse) Lai@JCJesseLai

Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core ideas that shaped diffusion modeling and explains how today’s models work, why they work, and where they’re heading. 🧵You’ll find the link and a few highlights in the thread. We’d love to hear your thoughts and join some discussions! ⚡ Stay tuned for our markdown version, where you can drop your comments!

English

132

1.1K

126.8K

Ali Razavi@catamorphist·28 Eki

@jacobaustin132 AI can have the opposite effect as well? economist.com/finance-and-ec…

English

110

Jacob Austin@jacobaustin132·28 Eki

The idea that our economy basically exists to enable corporate extraction of rents from ordinary people has gone from far-fetched to highly plausible in the last decade, and AI plays a core role in this story: both as a mechanism for consolidation and an excuse for it

English

1.2K

Ali Razavi retweetledi

Demis Hassabis@demishassabis·21 Eki

Awesome to see Veo 3.1 top the LMArena video leaderboards by a large distance with big improvements over Veo 3.0 for text-to-video (+30) and image-to-video (+70)! 🔥Huge congrats to the team! Try it for yourself in flow.google and the @GeminiApp

Arena.ai@arena

🚨🎬 Big news from Video Arena! @GoogleDeepMind’s latest Veo 3.1 now ranks #1 in both Text-to-Video and Image-to-Video leaderboards. 🏆 This is a +30-point leap from Veo 3.0 → 3.1, making it the first model to break 1400 in Video Arena history! Huge congrats to the @GoogleDeepMind team for pushing the frontier of video generation forward! More details in the thread 🧵

English

107

198.4K

Ali Razavi retweetledi

Google DeepMind@GoogleDeepMind·20 Eki

Veo is getting new precision editing capabilities that let you easily add or remove elements from a scene - all while preserving the integrity of your original video. 🎥

English

267

2.2K

266.7K

Ali Razavi retweetledi

Alex Kontorovich@AlexKontorovich·19 Eyl

Congratulations to Jesse, Jared @jdlichtman, and Christian @ChrSzegedy on this great result! (They told me and Terry about it weeks ago, but released it while I was giving a lecture series in Italy last week, followed by speaking at a conference this week at Harvard -- where I got to chat some more with Jared; so I’m only now getting around to perusing the blueprint+code.) What I’m impressed by: (cont'd)

Math, Inc.@mathematics_inc

Today we're announcing Gauss, our first autoformalization agent that just completed Terry Tao & Alex Kontorovich's Strong Prime Number Theorem project in 3 weeks—an effort that took human experts 18+ months of partial progress.

English

194

33.2K

Ali Razavi retweetledi

Google DeepMind@GoogleDeepMind·18 Eyl

We’re announcing a major advance in the study of fluid dynamics with AI 💧 in a joint paper with researchers from @BrownUniversity, @nyuniversity and @Stanford.

English

180

720

1.1M

Ali Razavi retweetledi

Aleksander Holynski@holynski_·8 Ağu

Something fun we discovered: you can use #Genie3 to step into and explore your favorite paintings. Here's a short visit to Edward Hopper's "Nighthawks".

English

1.2K

1.1K

11.1K

9.7M

Ali Razavi retweetledi

Emma Wang@yu_emma_wang·5 Ağu

It's one of the most interesting serving problems to solve. Thanks so much @jparkerholder @shlomifruchter and Stephen Spencer for the opportunity to work on Genie 3. It's an amazing team and I had so much fun ❤️ Also thanks to @tink_expo_ @ShanHan3290 @cip_baetu @vadikrobot

Demis Hassabis@demishassabis

Genie 3 is here - it can generate an entire world simulation that you can interact with in real-time, just from a text prompt! It's pretty mind-blowing really when you stop to think about it, and it's rapidly improving - one day we will be able to build the Holodeck for real!

English

3.3K

Ali Razavi retweetledi

Jakob Bauer@jkbr_ai·6 Ağu

Yesterday we announced Genie 3. One feature of the model that's especially fun to play with is starting worlds from existing videos. Here's a drone shot generated by Veo 3, with me taking control mid-flight.

Google DeepMind@GoogleDeepMind

What if you could not only watch a generated video, but explore it too? 🌐 Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵

English

219

2.5K

702.7K

Ali Razavi retweetledi

Agrim Gupta@agrimgupta92·5 Ağu

Introducing Genie 3, our state-of-the-art world model that generates interactive worlds from text, enabling real-time interaction at 24 fps with minutes-long consistency at 720p. 🧵👇

English

171

1.4K

458.5K

Ali Razavi retweetledi

Shlomi Fruchter@shlomifruchter·5 Ağu

Excited to introduce Genie 3, our general purpose world model that creates interactive, playable environments from any text prompt. It can generate dynamic worlds at 720p and 24 FPS, with each frame created in response to user actions in *real-time*.

Google DeepMind@GoogleDeepMind

English

604

92.5K

Ali Razavi@catamorphist·1 Ağu

So sad to hear we lost the legendary theatre icon Robert Wilson. Einstein on the Beach 13 yrs ago was a truly formative experience. I went for Philip Glass' music (loved) but Wilson's stage design blew my mind! Sharing a few scenes from works I had the privilege to see. RIP.

English

8.6K

Ali Razavi retweetledi

Jim Fan@DrJimFan·25 Tem

I'm observing a mini Moravec's paradox within robotics: gymnastics that are difficult for humans are much easier for robots than "unsexy" tasks like cooking, cleaning, and assembling. It leads to a cognitive dissonance for people outside the field, "so, robots can parkour & breakdance, but why can't they take care of my dog?" Trust me, I got asked by my parents about this more than you think ... The "Robot Moravec's paradox" also creates the illusion that physical AI capabilities are way more advanced than they truly are. I'm not singling out Unitree, as it applies widely to all recent acrobatic demos in the industry. Here's a simple test: if you set up a wall in front of the side-flipping robot, it will slam into it at full force and make a spectacle. Because it's just overfitting that single reference motion, without any awareness of the surroundings. Here's why the paradox exists: it's much easier to train a "blind gymnast" than a robot that sees and manipulates. The former can be solved entirely in simulation and transferred zero-shot to the real world, while the latter demands extremely realistic rendering, contact physics, and messy real-world object dynamics - none of which can be simulated well. Imagine you can train LLMs not from the internet, but from a purely hand-crafted text console game. Roboticists got lucky. We happen to live in a world where accelerated physics engines are so good that we can get away with impressive acrobatics using literally zero real data. But we haven't yet discovered the same cheat code for general dexterity. Till then, we'll still get questioned by our confused parents.

English

144

549

2.5K

397.7K

Ali Razavi@catamorphist·22 Tem

The dream job. Literally.

Dumitru Erhan@doomie

Want to be part of a team redefining SOTA for generative video models? Excited about building models that can reach billions of users? The Veo team is hiring! We are looking for amazing researchers and engineers, in North America and Europe. Details below:

English

523

Ali Razavi retweetledi

Demis Hassabis@demishassabis·21 Tem

Btw as an aside, we didn’t announce on Friday because we respected the IMO Board's original request that all AI labs share their results only after the official results had been verified by independent experts & the students had rightly received the acclamation they deserved

English

121

292.1K

Ali Razavi retweetledi

Google DeepMind@GoogleDeepMind·21 Tem

An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵

English

152

691

4.3K

1.1M

Ali Razavi retweetledi

Pauline Luc@paulineluc_·8 Tem

So pleased and proud to share with you what our team has been up to, on an ambitious journey to build a video foundation model for scientific domains ! ✨ 🚀 🎞️ 🧪 #ICCV2025 #AI4Science

Yana Hasson@yanahasson

Thrilled to share our latest work on SciVid, to appear at #ICCV2025! 🎉 SciVid offers cross-domain evaluation of video models in scientific applications, including medical CV, animal behavior, & weather forecasting 🧪🌍📽️🪰🐭🫀🌦️ #AI4Science #FoundationModel #CV4Science [1/5]🧵

English

463

Keşfet

@jacobaustin132 @GeminiApp @jdlichtman @ChrSzegedy @BrownUniversity @nyuniversity @Stanford @jparkerholder