Ali Razavi

180 posts

Ali Razavi

Ali Razavi

@catamorphist

Research Scientist @GoogleDeepmind Working on generative models (Veo, Imagen,...) in the GenMedia team.

San Francisco, CA Katılım Temmuz 2010
964 Takip Edilen562 Takipçiler
Ali Razavi retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video. It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵
English
390
1.3K
8.3K
1.1M
Ali Razavi retweetledi
koray kavukcuoglu
koray kavukcuoglu@koraykv·
Gemini 3 Pro 🤝 Nano Banana Pro New SOTA image generation and editing. 🍌Production-ready visuals with improved precision and control 🍌Images grounded in Gemini’s real world understanding 🍌Superior text rendering 🍌Localization in multiple languages 🍌Physically accurate lighting and dynamics. And character consistency across 14 (fluffy) inputs! blog.google/technology/ai/…
koray kavukcuoglu tweet media
English
37
75
567
103.7K
Ali Razavi retweetledi
Stefano Ermon
Stefano Ermon@StefanoErmon·
Tired of chasing references across dozens of papers? This monograph distills it all: the principles, intuition, and math behind diffusion models. Thrilled to share!
Chieh-Hsin (Jesse) Lai@JCJesseLai

Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core ideas that shaped diffusion modeling and explains how today’s models work, why they work, and where they’re heading. 🧵You’ll find the link and a few highlights in the thread. We’d love to hear your thoughts and join some discussions! ⚡ Stay tuned for our markdown version, where you can drop your comments!

English
13
132
1.1K
126.8K
Jacob Austin
Jacob Austin@jacobaustin132·
The idea that our economy basically exists to enable corporate extraction of rents from ordinary people has gone from far-fetched to highly plausible in the last decade, and AI plays a core role in this story: both as a mechanism for consolidation and an excuse for it
English
1
0
7
1.2K
Ali Razavi retweetledi
Demis Hassabis
Demis Hassabis@demishassabis·
Awesome to see Veo 3.1 top the LMArena video leaderboards by a large distance with big improvements over Veo 3.0 for text-to-video (+30) and image-to-video (+70)! 🔥Huge congrats to the team! Try it for yourself in flow.google and the @GeminiApp
Arena.ai@arena

🚨🎬 Big news from Video Arena! @GoogleDeepMind’s latest Veo 3.1 now ranks #1 in both Text-to-Video and Image-to-Video leaderboards. 🏆 This is a +30-point leap from Veo 3.0 → 3.1, making it the first model to break 1400 in Video Arena history! Huge congrats to the @GoogleDeepMind team for pushing the frontier of video generation forward! More details in the thread 🧵

English
49
107
1K
198.4K
Ali Razavi retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
Veo is getting new precision editing capabilities that let you easily add or remove elements from a scene - all while preserving the integrity of your original video. 🎥
English
94
267
2.2K
266.7K
Ali Razavi retweetledi
Alex Kontorovich
Alex Kontorovich@AlexKontorovich·
Congratulations to Jesse, Jared @jdlichtman, and Christian @ChrSzegedy on this great result! (They told me and Terry about it weeks ago, but released it while I was giving a lecture series in Italy last week, followed by speaking at a conference this week at Harvard -- where I got to chat some more with Jared; so I’m only now getting around to perusing the blueprint+code.) What I’m impressed by: (cont'd)
Math, Inc.@mathematics_inc

Today we're announcing Gauss, our first autoformalization agent that just completed Terry Tao & Alex Kontorovich's Strong Prime Number Theorem project in 3 weeks—an effort that took human experts 18+ months of partial progress.

English
8
28
194
33.2K
Ali Razavi retweetledi
Aleksander Holynski
Aleksander Holynski@holynski_·
Something fun we discovered: you can use #Genie3 to step into and explore your favorite paintings. Here's a short visit to Edward Hopper's "Nighthawks".
English
1.2K
1.1K
11.1K
9.7M
Ali Razavi retweetledi
Ali Razavi retweetledi
Jakob Bauer
Jakob Bauer@jkbr_ai·
Yesterday we announced Genie 3. One feature of the model that's especially fun to play with is starting worlds from existing videos. Here's a drone shot generated by Veo 3, with me taking control mid-flight.
Google DeepMind@GoogleDeepMind

What if you could not only watch a generated video, but explore it too? 🌐 Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵

English
92
219
2.5K
702.7K
Ali Razavi retweetledi
Agrim Gupta
Agrim Gupta@agrimgupta92·
Introducing Genie 3, our state-of-the-art world model that generates interactive worlds from text, enabling real-time interaction at 24 fps with minutes-long consistency at 720p. 🧵👇
English
66
171
1.4K
458.5K
Ali Razavi retweetledi
Shlomi Fruchter
Shlomi Fruchter@shlomifruchter·
Excited to introduce Genie 3, our general purpose world model that creates interactive, playable environments from any text prompt. It can generate dynamic worlds at 720p and 24 FPS, with each frame created in response to user actions in *real-time*.
Google DeepMind@GoogleDeepMind

What if you could not only watch a generated video, but explore it too? 🌐 Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵

English
41
72
604
92.5K
Ali Razavi
Ali Razavi@catamorphist·
So sad to hear we lost the legendary theatre icon Robert Wilson. Einstein on the Beach 13 yrs ago was a truly formative experience. I went for Philip Glass' music (loved) but Wilson's stage design blew my mind! Sharing a few scenes from works I had the privilege to see. RIP.
Ali Razavi tweet mediaAli Razavi tweet mediaAli Razavi tweet mediaAli Razavi tweet media
English
0
3
10
8.6K
Ali Razavi retweetledi
Jim Fan
Jim Fan@DrJimFan·
I'm observing a mini Moravec's paradox within robotics: gymnastics that are difficult for humans are much easier for robots than "unsexy" tasks like cooking, cleaning, and assembling. It leads to a cognitive dissonance for people outside the field, "so, robots can parkour & breakdance, but why can't they take care of my dog?" Trust me, I got asked by my parents about this more than you think ... The "Robot Moravec's paradox" also creates the illusion that physical AI capabilities are way more advanced than they truly are. I'm not singling out Unitree, as it applies widely to all recent acrobatic demos in the industry. Here's a simple test: if you set up a wall in front of the side-flipping robot, it will slam into it at full force and make a spectacle. Because it's just overfitting that single reference motion, without any awareness of the surroundings. Here's why the paradox exists: it's much easier to train a "blind gymnast" than a robot that sees and manipulates. The former can be solved entirely in simulation and transferred zero-shot to the real world, while the latter demands extremely realistic rendering, contact physics, and messy real-world object dynamics - none of which can be simulated well. Imagine you can train LLMs not from the internet, but from a purely hand-crafted text console game. Roboticists got lucky. We happen to live in a world where accelerated physics engines are so good that we can get away with impressive acrobatics using literally zero real data. But we haven't yet discovered the same cheat code for general dexterity. Till then, we'll still get questioned by our confused parents.
English
144
549
2.5K
397.7K
Ali Razavi retweetledi
Demis Hassabis
Demis Hassabis@demishassabis·
Btw as an aside, we didn’t announce on Friday because we respected the IMO Board's original request that all AI labs share their results only after the official results had been verified by independent experts & the students had rightly received the acclamation they deserved
English
36
121
2K
292.1K
Ali Razavi retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵
Google DeepMind tweet media
English
152
691
4.3K
1.1M
Ali Razavi retweetledi
Pauline Luc
Pauline Luc@paulineluc_·
So pleased and proud to share with you what our team has been up to, on an ambitious journey to build a video foundation model for scientific domains ! ✨ 🚀 🎞️ 🧪 #ICCV2025 #AI4Science
Yana Hasson@yanahasson

Thrilled to share our latest work on SciVid, to appear at #ICCV2025! 🎉 SciVid offers cross-domain evaluation of video models in scientific applications, including medical CV, animal behavior, & weather forecasting 🧪🌍📽️🪰🐭🫀🌦️ #AI4Science #FoundationModel #CV4Science [1/5]🧵

English
0
1
15
463