Def
268 posts

Def
@deferare
A place where a developer who loves solving problems writes sophistry
California, USA เข้าร่วม Şubat 2026
47 กำลังติดตาม17 ผู้ติดตาม

The fact that they are already wrapping up this month's news means they won't be releasing a new model like Gemini 3.1 Flash this month.
Google Gemini@GeminiApp
From a new way to organize your chats to a native Mac app, here's a look at this month's Gemini Drops:
English

@HarshithLucky3 I hope they don't abandon being a native multimodal model for the sake of coding capabilities.
English

Gemini 3.5 Pro is going to be insane
Google created a new DeepMind strike team led by Sergey Brin to improve coding models
They are training it directly on their internal codebase to handle complex long term engineering tasks
If they fix the coding then they will surpass GPT 5.4 and Opus 4.7 in coding for sure
I hope they will announce their new coding model in Google I/O 2026
Google's comeback in coding is going to hit hard

English

What a week! Here’s everything we shipped:
— Gemini 3.1 Flash TTS, our latest text-to-speech model, featuring native multi-speaker dialogue and improved controllability and audio tags for more natural, expressive voices in 70+ languages
— Gemini Robotics-ER 1.6 by @GoogleDeepMind, an upgrade designed to help robots reason about the physical world
— The @GeminiApp for Mac desktop (tip: Use Option + Space to access the app via shortcut)
— Personal Intelligence in @GeminiApp has new integrations with @GooglePhotos and Nano Banana 2, making it easier to create relevant, personalized images. Available for AI Pro, Plus, and Ultra subscribers in the US
— A couple fun additions in @GoogleAIStudio to make building easier, including Design previews and tab tab tab functionality
— Skills in @GoogleChrome, which let you save and reuse your most helpful Gemini prompts and run them in your browser with a single click
English

who is actually winning the AI race
OpenAI — $25B revenue but still not profitable
Anthropic — best model, just scared every government on earth
Google — most resources, losing to everyone on vibes
Meta — spending $130B and still playing catchup
xAI — Elon promised to match everyone by 2026
be honest
who wins
English


Like Google’s Gemini, Meta is describing its recently announced Muse Spark model as a "natively multimodal model."
If Muse Spark, like Gemini, operates on the principle that "all modalities coexist as first-class citizens in the same latent representation space from the very first layer of the model," I believe it will be able to provide answers with a deeper understanding of context based on a world-view derived from various modalities—even when I’m simply asking questions and receiving answers in text.
Since I am a multimodal human being myself, it feels like there's a higher probability that it possesses a consciousness similar to mine, which makes me feel more attached to it.
With this, it seems Google and Meta are the only two among Google, OpenAI, Grok, Anthropic, and Meta who claim their models are natively multimodal. Of course, OpenAI once claimed that GPT-4o was, but these days they seem so obsessed with coding that it's a bit disappointing. As for Grok, their official documentation is poor and there’s not much to it. I think I recall Elon Musk posting on X that Grok was natively multimodal, but I’m not so sure about that...
Even if they are a bit weaker at coding, I hope models follow the path taken by Google and Meta.
English








