Tris Warkentin

140 posts

Tris Warkentin

@triswarkentin

Tech builder, Google DeepMind Product Management Director, and all-around happy guy. Launched Gemma, Bard, Imagen, and many other neat AI things.

SF Bay Area, CA Katılım Şubat 2010

136 Takip Edilen1.1K Takipçiler

Sabitlenmiş Tweet

Tris Warkentin@triswarkentin·12 Mar

Today, we launch Gemma 3 in Paris! Excited to share the strongest model in the world that can run on a single GPU -- outperforming Llama-405B, DeepSeek-v3, and even o3-mini. Multilingual in over 140 languages with a 128 context window and multimodality! 🔥

English

7.9K

Tris Warkentin retweetledi

NVIDIA Robotics@NVIDIARobotics·10 Mar

Open models are moving beyond the data center and into the physical world. 🤖 Models like @Alibaba_Qwen, @Google's Gemma, @MistralAI’s Mistral, @OpenAI’s GPT‑OSS, @physical_int’s Pi, and others now run locally on NVIDIA Jetson, powering everything from industrial machines to personal AI assistants built with OpenClaw. 🦞 See how developers are optimizing open models on Jetson to build real-time physical AI at the edge. Read the blog 🔗 nvda.ws/4bbKH2h

English

125

6.3K

Tris Warkentin@triswarkentin·27 Şub

Banana + Banana! (ft me + the amazing PMs behind Nano Banana 2)

English

391

Tris Warkentin retweetledi

Kiran Vodrahalli@kiranvodrahalli·19 Şub

I'm happy to announce that we are open sourcing the MRCR v2 datasets we report on Gemini releases! github.com/google-deepmin…

Kiran Vodrahalli@kiranvodrahalli

Happy to share Michelangelo (arxiv.org/abs/2409.12640), a long-context reasoning benchmark which measures performance beyond needle tasks up to arbitrary context lengths and remains challenging for frontier models. Stay tuned for more Michelangelo evals to come!

English

24.5K

Tris Warkentin@triswarkentin·19 Şub

@PreciousBa82157 @sundarpichai GA coming soon, see blog.google/innovation-and…

English

490

Precious Balogun@PreciousBa82157·19 Şub

@sundarpichai A new model is out while the previous model is still in preview,only Google can pull this off

English

134

15.6K

Sundar Pichai@sundarpichai·19 Şub

Gemini 3.1 Pro is here. Hitting 77.1% on ARC-AGI-2, it’s a step forward in core reasoning (more than 2x 3 Pro). With a more capable baseline, it’s great for super complex tasks like visualizing difficult concepts, synthesizing data into a single view, or bringing creative projects to life. We’re shipping 3.1 Pro across our consumer and developer products to bring this underlying leap in intelligence to your everyday applications right away. Rolling out now to: - Developers in preview via the Gemini API in @GoogleAIStudio - Enterprises in Vertex AI and Gemini Enterprise - Everyone through the @Geminiapp and @NotebookLM

English

566

1.1K

10.2K

1.3M

Tris Warkentin@triswarkentin·19 Şub

@yvesai0 @OfficialLoganK Yes, factuality was a major target of this release.

English

165

yves@yvesai0·19 Şub

@OfficialLoganK Looks really good. Has the hallucination rate and tool call usage been fixed as well ?

English

12.2K

Logan Kilpatrick@OfficialLoganK·19 Şub

Introducing Gemini 3.1 Pro, our new SOTA model across most reasoning, coding, and stem use cases!

English

568

591

7.5K

635.7K

Tris Warkentin@triswarkentin·19 Şub

Gemini 3.1 Pro launches today -- and sets a new state-of-the-art mark for a foundation model intelligence. Try it today across Gemini App, AI Studio, Antigravity, or Vertex!

Artificial Analysis@ArtificialAnlys

Google is once again the leader in AI: Gemini 3.1 Pro Preview leads the Artificial Analysis Intelligence Index, 4 points ahead of Claude Opus 4.6 while costing less than half as much to run @GoogleDeepMind gave us pre-release access to Gemini 3.1 Pro Preview. It leads 6 of the 10 evaluations that make up the Artificial Analysis Intelligence Index and improves significantly over Gemini 3 Pro Preview across capabilities, with the biggest gains in reasoning and knowledge, coding, and hallucination reduction. Gemini 3.1 Pro Preview also remains relatively token efficient, using ~57M tokens to run the Artificial Analysis Intelligence Index (+1M from Gemini 3 Pro Preview), lower than other frontier models at max reasoning settings such as Opus 4.6 (max) and GPT-5.2 (xhigh). Combined with lower per-token pricing, Gemini 3.1 Pro Preview is cost-efficient among frontier peers, costing less than half as much as Opus 4.6 (max) to run the full Intelligence Index, though still nearly 2x the leading open-weights model, GLM-5. Key Takeaways: ➤ State-of-the-art intelligence at lower costs: Gemini 3.1 Pro Preview is leading 6 of the 10 evaluations that make up the Artificial Analysis Intelligence Index at less than half the cost to run of frontier peers from @OpenAI and @AnthropicAI. It obtains the highest score in Terminal-Bench Hard (agentic coding), AA-Omniscience (knowledge & hallucination), Humanity’s Last Exam (reasoning & knowledge), GPQA-Diamond (scientific reasoning), SciCode (coding) and CritPt (research-level physics). The CritPt score is particularly notable, scoring 18% on unpublished, research-level physics reasoning problems, over 5 p.p. above the next best model ➤ Improved real-world agentic performance, but not leading: Gemini 3.1 Pro Preview shows an improvement in GDPval-AA, our agentic evaluation focusing on real-world tasks, but is still not the leading model in this area. The model increases its ELO score over 100 points to 1316 (up from Gemini 3 Pro Preview), however still sits behind Claude Sonnet 4.6, Opus 4.6, GPT-5.2 (xhigh), and GLM-5 ➤ Leading coding abilities: Gemini 3.1 Pro Preview leads the Artificial Analysis Coding Index, achieving the highest score in both Terminal-Bench Hard (54%) and SciCode (59%) ➤ Reduced hallucinations: Gemini 3.1 Pro Preview shows a major improvement in tendency to guess incorrectly when it doesn’t know the answer, reducing its AA-Omniscience hallucination rate by 38 p.p. from Gemini 3 Pro Preview ➤ Maintained token and cost efficiency: Gemini 3.1 Pro Preview improves without material increases in cost or token usage. It uses only ~2% more tokens to run the Artificial Analysis Intelligence Index than Gemini 3 Pro Preview, and keeps the same pricing ($2/$12 per 1M input/output tokens for ≤200k context). Its cost to run the Artificial Analysis Intelligence Index of $892 is less than half of frontier models such as Opus 4.6 (max) and GPT-5.2 (xhigh), though still ~2x the cost of leading open weights models such as GLM 5 ($547) ➤ Google takes top 3 spots in multi-modality: Gemini 3.1 Pro Preview ranks #1 on MMMU-Pro, our multimodal understanding and reasoning benchmark, ahead of Gemini 3 Pro Preview and Gemini 3 Flash, reinforcing Google’s leadership in multimodal reasoning ➤ Other model details: Gemini 3.1 Pro Preview retains the same 1 million token context window as its predecessor, and includes support for tool calling, structured outputs, and JSON mode

English

1.1K

Tris Warkentin@triswarkentin·12 Şub

Today, we take another big step towards safe, useful AGI -- Gemini Deep Think 3 sets new state-of-the-art marks for very difficult evals, including ARC-AGI-2 and Humanity's Last Exam. It's just another Thursday. Read more: blog.google/innovation-and…

English

1.7K

Tris Warkentin@triswarkentin·19 Ara

@GoogleResearch @ymatias Amazing to see 3 Gemma models on this infographic - VaultGemma, MedGemma, and C2S-Scale! Looking forward to more great research together in 2026 🥳

English

613

Google Research@GoogleResearch·19 Ara

In 2025, the magic cycle of research accelerated. Google Research teams delivered pioneering breakthroughs and brought our research to reality with impact on products, science, and society. More from @ymatias, VP and Head of Google Research →goo.gle/499MMJQ

English

127

735

178.9K

Tris Warkentin@triswarkentin·11 Ara

Absolutely amazing projects built for mobile with Gemma 3n -- take a look! I'm genuinely blown away.

Google AI Developers@googleaidevs

Introducing the winners of the Gemma 3n Impact Challenge on @kaggle 🏆(🧵) Learn how the global developer community built with Gemma to address critical challenges and make a difference in people’s lives through the potential of on-device, multimodal AI. blog.google/technology/dev…

English

456

Tris Warkentin retweetledi

Jeff Dean@JeffDean·18 Kas

Also, we're exploring new ways to create games using Gemini 3 - we've created and published a few experimental YouTube Playables. Each of the games shown in the video below was created by Gemini 3 from scratch w/a few natural language prompts/images. Try them out at @googledeepmind/playables" target="_blank" rel="nofollow noopener">youtube.com/@googledeepmin… 👾👾

English

201

2.1K

774.2K

Tris Warkentin retweetledi

ollama@ollama·18 Kas

ollama run gemini-3-pro-preview 🧠 State-of-the-art reasoning 🖼️ Deep multimodal understanding 💻 Powerful vibe coding so you can go from prompt to app in one shot ⭐ Improved agentic capabilities, so it can get things done on your behalf, at your direction Gemini 3 Pro is available on Ollama's Cloud Max plan. We are working on expanding availability across Ollama. Learn more about Ollama's cloud 👇👇👇

English

728

191.9K

Tris Warkentin@triswarkentin·17 Kas

@R__zon @vasuman That would be the inimitable Jeff Dean, no big deal.

English

dAvId@R__zon·17 Kas

@vasuman Props to the team and the member who came up with the name Gemini. Fits so eloquently.

English

1.4K

vas@vasuman·17 Kas

I lied, there is no Gemini 3.0 Welcome back, Bard

English

4.3K

363.5K

Tris Warkentin@triswarkentin·17 Kas

@ChannelRSL1 @vasuman It was not intended to be the long-term name of the product in the first place -- I came up with it as a project codename... (Though I did love the nerdy Shakespeare reference and even made a Shakespeare logo for the project)

English

366

H@Mr_Hy4·17 Kas

@vasuman Bard was such a bad name 😭 what was google thinking

English

201

27.7K

Tris Warkentin@triswarkentin·13 Kas

Ready to learn how to Prompt Like A Pro? Join me for the next Gemini Discord event today, 11/13, at 11:30am PT! I will spotlight real-life use cases to help you take your prompting skills to the next level. Join us on the Gemini Discord! discord.gg/gemini

English

290

Tris Warkentin retweetledi

AI at AMD@AIatAMD·7 Kas

"You have an important role to play, no matter who you are, in figuring out how it is that we roll out AI for everyone." Hear more from @triswarkentin, Director of Product Management at Google DeepMind 👇

English

Tris Warkentin@triswarkentin·16 Eki

@kevinweil @sundarpichai Thanks, Kevin -- one of my favorite projects in the Gemmaverse =)

English

Kevin Weil 🇺🇸@kevinweil·16 Eki

This is rad. Congrats @sundarpichai and team!

Sundar Pichai@sundarpichai

An exciting milestone for AI in science: Our C2S-Scale 27B foundation model, built with @Yale and based on Gemma, generated a novel hypothesis about cancer cellular behavior, which scientists experimentally validated in living cells. With more preclinical and clinical tests, this discovery may reveal a promising new pathway for developing therapies to fight cancer.

English

945

158.9K

Tris Warkentin retweetledi

prinz@deredleritt3r·15 Eki

Just to recap: We found out today that an LLM that fits on a high-end consumer GPU, when trained on specific biological data, can discover a novel method to make cancer tumors more responsive to immunotherapy. Confirmed novel discovery (not present in existing literature). Experimentally validated in living cells. This is AI generating novel science. The moment has finally arrived.

prinz@deredleritt3r

Google and Yale scientists have trained an LLM that has generated a novel hypothesis about cancer cellular behavior. This prediction was confirmed multiple times in vitro. - "What made this prediction so exciting was that it was a novel idea. Although CK2 has been implicated in many cellular functions, including as a modulator of the immune system, inhibiting CK2 via silmitasertib has not been reported in the literature to explicitly enhance MHC-I expression or antigen presentation. This highlights that the model was generating a new, testable hypothesis, and not just repeating known facts." The model that generated this prediction is a 27B-parameter LLM based on the Google Gemma open source models, and trained on a corpus comprising >1B tokens of transcriptomic data, biological text, and metadata. Quite remarkable that a small (just 27B) LLM trained on specialized data is able to make novel scientific discoveries. "Teams at Yale are now exploring the mechanism uncovered here and testing additional AI-generated predictions in other immune contexts. With further preclinical and clinical validation, such hypotheses may be able to ultimately accelerate the path to new therapies."

English

130

809

925.8K

Tris Warkentin@triswarkentin·15 Eki

One of my favorite launches of 2025 -- an unreasonably long list -- is a Gemma variant that shows promise for cancer treatment. Huge congratulations to @AziziShekoofeh and the broad research team!

Google AI Developers@googleaidevs

⚕️ Introducing C2S-Scale 27B, our new Gemma open model that can translate complex single-cell gene expression data into “cell sentences” that LLMs can understand.

English

1.7K

Tris Warkentin retweetledi

Sundar Pichai@sundarpichai·15 Eki

English

545

3.3K

21.9K

6.9M

Tris Warkentin@triswarkentin·10 Eyl

Benchmarking shouldn't be only multiple choice tests -- actual model behavior in resource-constrained environments will revolutionize model optimization. Excited to see these first steps; congratulations @kradleai !

Kradle@kradleai

Hello World, we’re Kradle.ai We eval frontier models by putting them in simulations. So what happens when 6 frontier models compete in #Minecraft for GPUs? Video and 🧵

English

289

Keşfet

@Alibaba_Qwen @Google @MistralAI @OpenAI @physical_int @PreciousBa82157 @sundarpichai @GoogleAIStudio