Tris Warkentin

140 posts

Tris Warkentin banner
Tris Warkentin

Tris Warkentin

@triswarkentin

Tech builder, Google DeepMind Product Management Director, and all-around happy guy. Launched Gemma, Bard, Imagen, and many other neat AI things.

SF Bay Area, CA Katılım Şubat 2010
136 Takip Edilen1.1K Takipçiler
Sabitlenmiş Tweet
Tris Warkentin
Tris Warkentin@triswarkentin·
Today, we launch Gemma 3 in Paris! Excited to share the strongest model in the world that can run on a single GPU -- outperforming Llama-405B, DeepSeek-v3, and even o3-mini. Multilingual in over 140 languages with a 128 context window and multimodality! 🔥
English
4
4
59
7.9K
Tris Warkentin retweetledi
NVIDIA Robotics
NVIDIA Robotics@NVIDIARobotics·
Open models are moving beyond the data center and into the physical world. 🤖 Models like @Alibaba_Qwen, @Google's Gemma, @MistralAI’s Mistral, @OpenAI’s GPT‑OSS, @physical_int’s Pi, and others now run locally on NVIDIA Jetson, powering everything from industrial machines to personal AI assistants built with OpenClaw. 🦞 See how developers are optimizing open models on Jetson to build real-time physical AI at the edge. Read the blog 🔗 nvda.ws/4bbKH2h
NVIDIA Robotics tweet media
English
4
17
125
6.3K
Tris Warkentin
Tris Warkentin@triswarkentin·
Banana + Banana! (ft me + the amazing PMs behind Nano Banana 2)
Tris Warkentin tweet media
English
0
0
16
391
Tris Warkentin retweetledi
Kiran Vodrahalli
Kiran Vodrahalli@kiranvodrahalli·
I'm happy to announce that we are open sourcing the MRCR v2 datasets we report on Gemini releases! github.com/google-deepmin…
Kiran Vodrahalli@kiranvodrahalli

Happy to share Michelangelo (arxiv.org/abs/2409.12640), a long-context reasoning benchmark which measures performance beyond needle tasks up to arbitrary context lengths and remains challenging for frontier models. Stay tuned for more Michelangelo evals to come!

English
2
9
60
24.5K
Precious Balogun
Precious Balogun@PreciousBa82157·
@sundarpichai A new model is out while the previous model is still in preview,only Google can pull this off
Precious Balogun tweet media
English
6
0
134
15.6K
Sundar Pichai
Sundar Pichai@sundarpichai·
Gemini 3.1 Pro is here. Hitting 77.1% on ARC-AGI-2, it’s a step forward in core reasoning (more than 2x 3 Pro). With a more capable baseline, it’s great for super complex tasks like visualizing difficult concepts, synthesizing data into a single view, or bringing creative projects to life. We’re shipping 3.1 Pro across our consumer and developer products to bring this underlying leap in intelligence to your everyday applications right away. Rolling out now to: - Developers in preview via the Gemini API in @GoogleAIStudio - Enterprises in Vertex AI and Gemini Enterprise - Everyone through the @Geminiapp and @NotebookLM
Sundar Pichai tweet media
English
566
1.1K
10.2K
1.3M
yves
yves@yvesai0·
@OfficialLoganK Looks really good. Has the hallucination rate and tool call usage been fixed as well ?
English
3
0
11
12.2K
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
Introducing Gemini 3.1 Pro, our new SOTA model across most reasoning, coding, and stem use cases!
Logan Kilpatrick tweet media
English
568
591
7.5K
635.7K
Tris Warkentin
Tris Warkentin@triswarkentin·
Gemini 3.1 Pro launches today -- and sets a new state-of-the-art mark for a foundation model intelligence. Try it today across Gemini App, AI Studio, Antigravity, or Vertex!
Artificial Analysis@ArtificialAnlys

Google is once again the leader in AI: Gemini 3.1 Pro Preview leads the Artificial Analysis Intelligence Index, 4 points ahead of Claude Opus 4.6 while costing less than half as much to run @GoogleDeepMind gave us pre-release access to Gemini 3.1 Pro Preview. It leads 6 of the 10 evaluations that make up the Artificial Analysis Intelligence Index and improves significantly over Gemini 3 Pro Preview across capabilities, with the biggest gains in reasoning and knowledge, coding, and hallucination reduction. Gemini 3.1 Pro Preview also remains relatively token efficient, using ~57M tokens to run the Artificial Analysis Intelligence Index (+1M from Gemini 3 Pro Preview), lower than other frontier models at max reasoning settings such as Opus 4.6 (max) and GPT-5.2 (xhigh). Combined with lower per-token pricing, Gemini 3.1 Pro Preview is cost-efficient among frontier peers, costing less than half as much as Opus 4.6 (max) to run the full Intelligence Index, though still nearly 2x the leading open-weights model, GLM-5. Key Takeaways: ➤ State-of-the-art intelligence at lower costs: Gemini 3.1 Pro Preview is leading 6 of the 10 evaluations that make up the Artificial Analysis Intelligence Index at less than half the cost to run of frontier peers from @OpenAI and @AnthropicAI. It obtains the highest score in Terminal-Bench Hard (agentic coding), AA-Omniscience (knowledge & hallucination), Humanity’s Last Exam (reasoning & knowledge), GPQA-Diamond (scientific reasoning), SciCode (coding) and CritPt (research-level physics). The CritPt score is particularly notable, scoring 18% on unpublished, research-level physics reasoning problems, over 5 p.p. above the next best model ➤ Improved real-world agentic performance, but not leading: Gemini 3.1 Pro Preview shows an improvement in GDPval-AA, our agentic evaluation focusing on real-world tasks, but is still not the leading model in this area. The model increases its ELO score over 100 points to 1316 (up from Gemini 3 Pro Preview), however still sits behind Claude Sonnet 4.6, Opus 4.6, GPT-5.2 (xhigh), and GLM-5 ➤ Leading coding abilities: Gemini 3.1 Pro Preview leads the Artificial Analysis Coding Index, achieving the highest score in both Terminal-Bench Hard (54%) and SciCode (59%) ➤ Reduced hallucinations: Gemini 3.1 Pro Preview shows a major improvement in tendency to guess incorrectly when it doesn’t know the answer, reducing its AA-Omniscience hallucination rate by 38 p.p. from Gemini 3 Pro Preview ➤ Maintained token and cost efficiency: Gemini 3.1 Pro Preview improves without material increases in cost or token usage. It uses only ~2% more tokens to run the Artificial Analysis Intelligence Index than Gemini 3 Pro Preview, and keeps the same pricing ($2/$12 per 1M input/output tokens for ≤200k context). Its cost to run the Artificial Analysis Intelligence Index of $892 is less than half of frontier models such as Opus 4.6 (max) and GPT-5.2 (xhigh), though still ~2x the cost of leading open weights models such as GLM 5 ($547) ➤ Google takes top 3 spots in multi-modality: Gemini 3.1 Pro Preview ranks #1 on MMMU-Pro, our multimodal understanding and reasoning benchmark, ahead of Gemini 3 Pro Preview and Gemini 3 Flash, reinforcing Google’s leadership in multimodal reasoning ➤ Other model details: Gemini 3.1 Pro Preview retains the same 1 million token context window as its predecessor, and includes support for tool calling, structured outputs, and JSON mode

English
0
1
20
1.1K
Tris Warkentin
Tris Warkentin@triswarkentin·
Today, we take another big step towards safe, useful AGI -- Gemini Deep Think 3 sets new state-of-the-art marks for very difficult evals, including ARC-AGI-2 and Humanity's Last Exam. It's just another Thursday. Read more: blog.google/innovation-and…
Tris Warkentin tweet media
English
12
2
14
1.7K
Tris Warkentin
Tris Warkentin@triswarkentin·
@GoogleResearch @ymatias Amazing to see 3 Gemma models on this infographic - VaultGemma, MedGemma, and C2S-Scale! Looking forward to more great research together in 2026 🥳
English
0
0
2
613
Google Research
Google Research@GoogleResearch·
In 2025, the magic cycle of research accelerated. Google Research teams delivered pioneering breakthroughs and brought our research to reality with impact on products, science, and society. More from @ymatias, VP and Head of Google Research →goo.gle/499MMJQ
Google Research tweet media
English
23
127
735
178.9K
Tris Warkentin retweetledi
Jeff Dean
Jeff Dean@JeffDean·
Also, we're exploring new ways to create games using Gemini 3 - we've created and published a few experimental YouTube Playables. Each of the games shown in the video below was created by Gemini 3 from scratch w/a few natural language prompts/images. Try them out at @googledeepmind/playables" target="_blank" rel="nofollow noopener">youtube.com/@googledeepmin… 👾👾
English
81
201
2.1K
774.2K
Tris Warkentin retweetledi
ollama
ollama@ollama·
ollama run gemini-3-pro-preview 🧠 State-of-the-art reasoning 🖼️ Deep multimodal understanding 💻 Powerful vibe coding so you can go from prompt to app in one shot ⭐ Improved agentic capabilities, so it can get things done on your behalf, at your direction Gemini 3 Pro is available on Ollama's Cloud Max plan. We are working on expanding availability across Ollama. Learn more about Ollama's cloud 👇👇👇
ollama tweet mediaollama tweet media
English
22
68
728
191.9K
dAvId
dAvId@R__zon·
@vasuman Props to the team and the member who came up with the name Gemini. Fits so eloquently.
English
1
0
2
1.4K
vas
vas@vasuman·
I lied, there is no Gemini 3.0 Welcome back, Bard
vas tweet media
English
85
61
4.3K
363.5K
Tris Warkentin
Tris Warkentin@triswarkentin·
@ChannelRSL1 @vasuman It was not intended to be the long-term name of the product in the first place -- I came up with it as a project codename... (Though I did love the nerdy Shakespeare reference and even made a Shakespeare logo for the project)
English
1
0
1
366
H
H@Mr_Hy4·
@vasuman Bard was such a bad name 😭 what was google thinking
English
6
0
201
27.7K
Tris Warkentin
Tris Warkentin@triswarkentin·
Ready to learn how to Prompt Like A Pro? Join me for the next Gemini Discord event today, 11/13, at 11:30am PT! I will spotlight real-life use cases to help you take your prompting skills to the next level. Join us on the Gemini Discord! discord.gg/gemini
English
0
0
1
290
Tris Warkentin retweetledi
AI at AMD
AI at AMD@AIatAMD·
"You have an important role to play, no matter who you are, in figuring out how it is that we roll out AI for everyone." Hear more from @triswarkentin, Director of Product Management at Google DeepMind 👇
English
0
1
4
3K
Kevin Weil 🇺🇸
Kevin Weil 🇺🇸@kevinweil·
This is rad. Congrats @sundarpichai and team!
Sundar Pichai@sundarpichai

An exciting milestone for AI in science: Our C2S-Scale 27B foundation model, built with @Yale and based on Gemma, generated a novel hypothesis about cancer cellular behavior, which scientists experimentally validated in living cells.  With more preclinical and clinical tests, this discovery may reveal a promising new pathway for developing therapies to fight cancer.

English
12
29
945
158.9K
Tris Warkentin retweetledi
prinz
prinz@deredleritt3r·
Just to recap: We found out today that an LLM that fits on a high-end consumer GPU, when trained on specific biological data, can discover a novel method to make cancer tumors more responsive to immunotherapy. Confirmed novel discovery (not present in existing literature). Experimentally validated in living cells. This is AI generating novel science. The moment has finally arrived.
prinz@deredleritt3r

Google and Yale scientists have trained an LLM that has generated a novel hypothesis about cancer cellular behavior. This prediction was confirmed multiple times in vitro. - "What made this prediction so exciting was that it was a novel idea. Although CK2 has been implicated in many cellular functions, including as a modulator of the immune system, inhibiting CK2 via silmitasertib has not been reported in the literature to explicitly enhance MHC-I expression or antigen presentation. This highlights that the model was generating a new, testable hypothesis, and not just repeating known facts." The model that generated this prediction is a 27B-parameter LLM based on the Google Gemma open source models, and trained on a corpus comprising >1B tokens of transcriptomic data, biological text, and metadata. Quite remarkable that a small (just 27B) LLM trained on specialized data is able to make novel scientific discoveries. "Teams at Yale are now exploring the mechanism uncovered here and testing additional AI-generated predictions in other immune contexts. With further preclinical and clinical validation, such hypotheses may be able to ultimately accelerate the path to new therapies."

English
130
809
9K
925.8K
Tris Warkentin retweetledi
Sundar Pichai
Sundar Pichai@sundarpichai·
An exciting milestone for AI in science: Our C2S-Scale 27B foundation model, built with @Yale and based on Gemma, generated a novel hypothesis about cancer cellular behavior, which scientists experimentally validated in living cells.  With more preclinical and clinical tests, this discovery may reveal a promising new pathway for developing therapies to fight cancer.
English
545
3.3K
21.9K
6.9M
Tris Warkentin
Tris Warkentin@triswarkentin·
Benchmarking shouldn't be only multiple choice tests -- actual model behavior in resource-constrained environments will revolutionize model optimization. Excited to see these first steps; congratulations @kradleai !
Kradle@kradleai

Hello World, we’re Kradle.ai We eval frontier models by putting them in simulations. So what happens when 6 frontier models compete in #Minecraft for GPUs? Video and 🧵

English
1
0
4
289