Mohammad Saffar

173 posts

Mohammad Saffar

@msaffar3

Research Scientist @googledeepmind, Gemini multi modal | past: @reveimage, Google brain

Mountain View, CA Katılım Temmuz 2016

432 Takip Edilen886 Takipçiler

Sabitlenmiş Tweet

Mohammad Saffar@msaffar3·3 Eyl

After a fantastic year at Reve AI, I’ve rejoined Google DeepMind to continue working on VEO. I was deeply involved in its early days, but the rapid progress from VEO 1 to VEO 3 in just one year has truly amazed me. It’s a testament to what can happen when you combine compute, brilliant minds, and a healthy dose of the "bitter lesson."

English

203

24.3K

Mohammad Saffar@msaffar3·4d

@Angaisb_ It is rarely about AR vs diffusion and almost always about data and bitter lessons.

English

329

Angel 🌼@Angaisb_·4d

Midjourney should have gone full AR and left diffusion behind They had the data, the compute and the talent yet somehow they still managed to become irrelevant. This isn't any better than older Midjourney models Sad to watch a company I genuinely liked fade out in real time

Mark Kretschmann@mark_k

The long-awaited testing phase for @Midjourney V8 has officially begun, marking a massive leap forward for the generative art platform. This latest iteration promises a significant boost in efficiency, operating at five times the speed of its predecessors while maintaining a much tighter grip on complex prompt instructions. High-resolution creators will find the native 2K modes particularly useful for professional workflows. The update also brings more reliable text rendering and enhanced "sref" styling, allowing for a level of aesthetic consistency that was previously difficult to achieve. Personalization is a major focus of this release, with improved moodboard performance to help users fine-tune their unique visual language. It is an impressive step toward making AI-assisted design both faster and more intuitive.

English

141

22.6K

Mohammad Saffar@msaffar3·12 Mar

@zhaisf Nice idea! In a causal setup the BOS becomes a vector of all zeros out of attn which could have some implications for post-norm.

English

1.4K

Shuangfei Zhai@zhaisf·12 Mar

Say hi to Exclusive Self Attention (XSA), a (nearly) free improvement to Transformers for LM. Observation: for y = attn(q, k, v), yᵢ and vᵢ tend to have a very high cosine similarity Fix: exclude vᵢ from yᵢ via zᵢ = yᵢ - (yᵢᵀvᵢ)vᵢ/‖vᵢ‖² Result: better training/val loss across model sizes; increasing gains as sequence length grows. See more: arxiv.org/abs/2603.09078

English

759

136.2K

Mohammad Saffar@msaffar3·10 Mar

@sainingxie @ylecun @amilabs Amazing news!! I am so excited to see the awesome technology you are building and I am rooting for you!🚀

English

304

Saining Xie@sainingxie·10 Mar

i’m joining forces with @ylecun and an incredible group of people to start AMI Labs @amilabs. AMI isn’t a conventional lab. we don’t intend to become one. a lot to say about why this moment matters, but for now we’re heads down building. join us: amilabs.xyz

AMI Labs@amilabs

Advanced Machine Intelligence (AMI) is building a new breed of AI systems that understand the world, have persistent memory, can reason and plan, and are controllable and safe. We’ve raised a $1.03B (~€890M) round from global investors who believe in our vision of universally intelligent systems centered on world models. This round is co-led by Cathay Innovation, Greycroft, Hiro Capital, HV Capital, and Bezos Expeditions, along with other investors and angels across the world. We are a growing team of researchers and builders, operating in Paris, New York, Montreal and Singapore from day one. Read more: amilabs.xyz AMI - Real world. Real intelligence.

English

153

162

2.8K

464.5K

Mohammad Saffar@msaffar3·10 Mar

🚀

Logan Kilpatrick@OfficialLoganK

Going to be a fun week of launches : )

ART

333

Mohammad Saffar@msaffar3·6 Mar

@skywalkeryxc @LumaLabsAI Very nice work! Congrats!

English

Xinchen Yan@skywalkeryxc·6 Mar

Congrats to the team @LumaLabsAI Uni-1 is the new frontier model that does unified multimodal understanding and generation!

Jiaming Song@baaadas

Excited to introduce Uni-1, our new *unified* multimodal model that does both understanding and generation: lumalabs.ai/uni-1 TLDR: I think Uni-1 @LumaLabsAI is > GPT Image 1.5 in many cases, and toe-to-toe with Nano Banana Pro/2. (showcase below)

English

2.6K

Mohammad Saffar@msaffar3·27 Şub

@tkipf Magic happens when we let models do their jon and learn from data

English

131

Thomas Kipf@tkipf·27 Şub

It's kind of incredible how things went from "oh man, character / asset consistency is such a hard research problem" (a bit over a year ago) to our latest models just casually perfectly composing 14(!) reference assets based on whichever incantation you throw at it.

Greenfield Team!@Team_Greenfield

Create elaborate scenes with Nano Banana 2 using 14 input images!

English

4.2K

Mohammad Saffar@msaffar3·25 Şub

@StefanoErmon Congrats Stefano! This is paradigm shifting work.

English

324

Stefano Ermon@StefanoErmon·24 Şub

Mercury 2 is live 🚀🚀 The world’s first reasoning diffusion LLM, delivering 5x faster performance than leading speed-optimized LLMs. Watching the team turn years of research into a real product never gets old, and I’m incredibly proud of what we’ve built. We’re just getting started on what diffusion can do for language.

English

320

587

4.2K

977.7K

Mohammad Saffar@msaffar3·25 Şub

Reasoning models showed increasing your inference time flops is good. Diffusion LLMs increase inference compute drastically but makes it parallelizable. Diffusion is a promising paradigm for all modalities indeed!

Stefano Ermon@StefanoErmon

English

706

Mohammad Saffar@msaffar3·24 Şub

@Taesung 🥹🚀 thanks to you and the super awesome team!

English

108

Taesung Park@Taesung·24 Şub

@msaffar3 Yesss your baby is finally seeing the light!

English

318

Mohammad Saffar@msaffar3·24 Şub

We figured out how to do native pixel diffusion at frontier modeling scale a while ago before it was cool! Great to see that an initial idea I had one day months ago while working on cool projects at Reve, comes to life. Huge congrats to the amazing team at Reve!🚀

Reve@reve

Reve v1.5 is here. Our latest image model, now with 4K resolution.

English

1.9K

Mohammad Saffar@msaffar3·6 Şub

@bneyshabur Best of luck Behnam! Excited to see what you work on next!

English

252

Behnam Neyshabur@bneyshabur·5 Şub

4) Building a company to build a technology to accelerate science Now I'm starting something new—focused on core bottlenecks that could unlock step-change acceleration across science and technology. It's ambitious. I'm about to learn a lot and be humbled. I see so many similarities between my experience in this new journey now and my weeks-long backpacking trips in the Alaskan wilderness with no guide and no trail. If this excites you and you want to learn more, reach out!

English

423

32.2K

Behnam Neyshabur@bneyshabur·5 Şub

I've left Anthropic to start something new. 🧵

English

156

2.9K

398K

Mohammad Saffar@msaffar3·17 Ara

Good example of why image generation needs to be pretty smart, we are in the beyond aesthetics era.

Oliver Wang@oliver_wang2

@sama Really impressive model, huge congrats to everyone who worked on it at OpenAI! However, the calendar is wrong, I fixed it for you in Nano Banana Pro 😀

English

2.8K

Mohammad Saffar@msaffar3·26 Kas

@_arohan_ It is a step change, going from cool to being actually useful

English

180

rohan anil@_arohan_·26 Kas

Nano banana is impressive and scary! I have been able to use it for lot of creative things like making game sprites. Mass market imagine generation with highly efficient inference tpus backing it.

English

4.1K

Mohammad Saffar@msaffar3·24 Kas

This is a really nice and insightful work that embraces removing hard coded assumptions.

Xinchen Yan@skywalkeryxc

For years, RAW pixel space pretraining has been sidelined: too compute-expensive. Our new @GoogleDeepMind paper 📜 dives into the scaling trends of raw pixel models to answer the question “how far are we from scaling up next-pixel prediction?” arxiv.org/pdf/2511.08704 Forecast: Raw next-pixel modeling will reach competitive ImageNet classification (>80% top1 accuracy) and generation metrics (90 Fr’echet Distance) in five years! Threads 👇

English

429

Mohammad Saffar@msaffar3·21 Kas

@gallabytes

QME

641

theseriousadult@gallabytes·20 Kas

I think we're going to need some new benchmarks

English

663

27.6K

Mohammad Saffar@msaffar3·20 Kas

@m__dehghani Savage 😂

Français

661

Mostafa Dehghani@m__dehghani·20 Kas

btw, you can bring your graph back to reality. You are welcome.

English

1.1K

174.4K

Mohammad Saffar@msaffar3·20 Kas

This is why image generation is not just image gen

Mostafa Dehghani@m__dehghani

Thinking (test-time compute) in pixel space... 🍌 Pro tip: always peek at the thoughts if you use AI Studio. Watching the model think in pictures is really fun!

English

607

Mohammad Saffar@msaffar3·20 Kas

This is such a smart image model!

Sundar Pichai@sundarpichai

You went 🍌🍌 for Nano Banana. Now, meet Nano Banana Pro. It’s SOTA for image generation + editing with more advanced world knowledge, text rendering, precision + controls. Built on Gemini 3, it’s really good at complex infographics - much like how engineers see the world:)

English

366

Mohammad Saffar@msaffar3·19 Kas

🤔

Phillip Lippe@phillip_lippe

Gemini 3 Pro is out with large jumps in multimodal understanding and reasoning. Sounds useful for another application we're picturing... 🎨

ART

440

Keşfet

@Angaisb_ @zhaisf @sainingxie @ylecun @amilabs @skywalkeryxc @LumaLabsAI @tkipf