Mario Lucic

496 posts

Mario Lucic banner
Mario Lucic

Mario Lucic

@MarioLucic_

Pushing the frontier of multimodal intelligence in Gemini. Director, Research Scientist at https://t.co/osxwuzykCr.

Zurich, Switzerland Katılım Aralık 2018
253 Takip Edilen3K Takipçiler
Sabitlenmiş Tweet
Mario Lucic
Mario Lucic@MarioLucic_·
Massive advancements in video understanding with Gemini 2.5! ✨ Unlock new capabilities to process hours of video, summarize and retrieve key moments, generate animations, and even combine video with code for interactive experiences. Check the 🧵below for some cool use-cases.
Antoine Yang@AntoineYang2

Thrilled to share our latest advances in video understanding 📽️: Gemini 2.5 Pro is a truly magical model to play with, excelling in traditional video analysis and unlocking new use cases I could not imagine a few months ago🪄 More in 🧵 and @Google blog: developers.googleblog.com/en/gemini-2-5-…

English
2
2
28
2.9K
Tim Salimans
Tim Salimans@TimSalimans·
Touching down now in SF for onboarding at Anthropic! After 7 great years at Google, I'm excited to take on a new challenge and help make Claude even better. Grateful for everything I learned at Google DeepMind and Brain, looking forward to what's next.
English
65
18
1.2K
255.4K
Olivier Bousquet
Olivier Bousquet@obousquet·
I am excited to share that I have started a new adventure at @MistralAI, a leading frontier lab, where I am working on pushing further the agentic reasoning capabilities of LLMs.
English
31
22
718
74.3K
Ian Goodfellow
Ian Goodfellow@goodfellow_ian·
I'd like to thank @daniel_rossett for his help in my recovery from the POTS version of Long COVID. Daniel was key in bringing me back from highly disabled and suffering to being able to do what I want to again. This X account is mostly focused on ML / AI. From that point of view, many of you know that in December 2024, I wasn't able to do the test of time award talk at NeurIPS, even by video call. Daniel started working with me in March 2025. By April, I started to have days of no POTS symptoms, by June I was off all heart rate lowering medications, by September I was back to work. I'm back to full exercise, running, lifting weights, mountain biking, and have even done things I hadn't done before I got sick, like riding Whistler Mountain Bike Park. I'm now getting the word out to help Daniel build a company that will bring this approach to more people.
English
170
83
2.7K
202.5K
Mario Lucic retweetledi
Rohan Doshi
Rohan Doshi@RohanLikesAI·
🚀 Excited to officially launch 👁Agentic Vision via Gemini 3 Flash. Gemini can run code execution on image uploads to zoom, analyze, and annotate: 🔍 Zoom: 5-10% quality win across vision benchmarks 🧮 Analyze: do image math with code (e.g. calculate the tip for a receipt) ✏️ Annotate: Draw arrows or bounding boxes to answer questions Try via the Gemini API (AI Studio / Vertex) or via the Gemini App (rolling out to Thinking mode today). Learn more→ goo.gle/4bsKdFv Demo: goo.gle/3Z05KxK cc: @IoanaBica95 @anastasija56572 @jalayrac @bcaine @eisenjulian @weichengkuo @phillip_lippe @xf1280 @tulseedoshi @BiboXu @OfficialLoganK
Google AI Developers@googleaidevs

Try 👁 Agentic Vision with Gemini 3 Flash in @GoogleAIStudio or Vertex AI. This new capability enables the model to effectively use code and reasoning to improve performance for common vision tasks. See Agentic Vision in action: goo.gle/3Z05KxK

English
8
27
237
29.3K
Mario Lucic retweetledi
Google AI Developers
Google AI Developers@googleaidevs·
Try 👁 Agentic Vision with Gemini 3 Flash in @GoogleAIStudio or Vertex AI. This new capability enables the model to effectively use code and reasoning to improve performance for common vision tasks. See Agentic Vision in action: goo.gle/3Z05KxK
English
24
115
862
170K
Mario Lucic retweetledi
Fei Xia
Fei Xia@xf1280·
🚀Excited to share that #Gemini 3 Flash can do code execution on images to zoom, count, and annotate visual inputs! The model can choose when to write code to: 🔍 Zoom & Inspect: Detect when details are too small and zoom-in. 🧮 Compute Visually: Run multi-step calculations using code (e.g., summing line items on a receipt). ✏️ Annotate: Draw arrows or bounding boxes to answer questions or show relationships between objects.
English
6
16
89
18.8K
Mario Lucic retweetledi
Rohan Doshi
Rohan Doshi@RohanLikesAI·
🚀🚀🚀 Gemini 3 Flash is live⚡️⚡️⚡️ 🧠 Gemini 3 Pro-level intelligence, but 📉 4x Cheaper and 🏎️ 3x Faster. We pushed the Pareto frontier, unlocking a new generation of multimodal agents. 🤯 We’re already seeing @figma accelerate rapid prototyping, @Harveyaisol master complex legal reasoning, and @Bridgewater unlock investment insights from massive unstructured datasets. 🤖Try now at gemini.google.com or start building at aistudio.google.com. Read more at lnkd.in/gsPv542a 👏🏼Huge shoutout to the team! @jalayrac @RSoricut @bcaine @xf1280 @IoanaBica95 @MarioLucic_ @BiboXu @tulseedoshi @OfficialLoganK @joshwoodward @demishassabis
Rohan Doshi tweet media
English
1
15
119
5.2K
Mario Lucic retweetledi
Mario Lucic retweetledi
Google AI Developers
Google AI Developers@googleaidevs·
Gemini 3 Pro is the frontier of multimodal AI, delivering SOTA performance across document, screen, spatial, and video understanding. Read our deep dive on how we’ve pushed our core capabilities to power hero use cases across: + Docs: "derender" complex docs into structured code (HTML/LaTeX) + Screen: build robust computer agents that automate complex tasks + Spatial: generate collision-free trajectories for robotics & XR + Video: analyze sports footage using high-FPS processing with "thinking" mode See how these capabilities are transforming workflows in education, biomedical, and law/finance → goo.gle/3Mt3UlT
Google AI Developers tweet media
English
45
133
1.1K
328.6K
Mario Lucic retweetledi
Chubby♨️
Chubby♨️@kimmonismus·
Google cooked so hard. Not gonna lie, this feels like the future is here. Now develop Google Glasses with enough battery power, a good chip, and a look like Ray-Bans, and you'll have an instant hit. 100%.
English
481
2.1K
17.4K
3.1M
Mario Lucic retweetledi
Mostafa Dehghani
Mostafa Dehghani@m__dehghani·
Thinking (test-time compute) in pixel space... 🍌 Pro tip: always peek at the thoughts if you use AI Studio. Watching the model think in pictures is really fun!
Mostafa Dehghani tweet mediaMostafa Dehghani tweet mediaMostafa Dehghani tweet media
English
21
77
696
135.8K
Mario Lucic retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
Our first release is Gemini 3 Pro, which is rolling out globally starting today. It significantly outperforms 2.5 Pro across the board: 🥇 Tops LMArena and WebDev @arena leaderboards 🧠 PhD-level reasoning on Humanity’s Last Exam 📋 Leads long-horizon planning on Vending-Bench 2
Google DeepMind tweet media
English
17
108
914
266.1K
Mario Lucic retweetledi
Rohan Doshi
Rohan Doshi@RohanLikesAI·
🚀 We just launched Gemini 3 Pro — the strongest multimodal understanding model ever built. I lead product for Gemini’s multimodal vision capabilities, and I want to share more about the massive wins we are seeing across document, screen, spatial, and video understanding. 🧵
English
8
6
46
9.4K
Mario Lucic retweetledi
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
Introducing our latest Gemini Live model 🔊, built on all the things you love about Gemini, with significantly improved function calling and more natural feeling / sounding conversations (thanks to native audio)! Try out the new model at ai.studio/live
English
105
144
1.9K
205.8K
Dustin Tran
Dustin Tran@dustinvtran·
I departed Google DeepMind after 8 years. So many fond memories—from early foundational papers in Google Brain (w/ @noamshazeer @ashvaswani @lukaszkaiser on Image Transformer, Tensor2Tensor, Mesh TensorFlow) to lead Gemini posttraining evals to catch up & launch in 100 days, then leading the team to leapfrog to LMArena #1 (and stay there for over a year!), and finally working on the incredible reasoning innovations for Gemini’s IMO & ICPC gold medals (w/ @HengTze @quocleix). Gemini has been a wild journey from one paradigm to another: first, revamping our LaMDA model (the first instruction-like chatbot!) from an actual chatbot to long contentful responses with RLHF; then, reasoning and deep thinking by training over long thinking chains, novel environments, and reward heads. When we first started, public sentiment was bad. Everyone thought Google was doomed to fail due to its search legacy and organizational politics. Now, Gemini is consistently #1 in user preference and spearheading new scientific accomplishments, and everyone thinks Google winning is obvious. 😂 (It also used to be the case that OpenAI would jump the AI newscycle by announcing before us from a backlog of ideas for every new Google release; safe to say that backlog is empty.) I have since joined xAI. The recipe is well-known. Compute, data, and O(100) brilliant, hard-working people are all that’s needed to obtain a frontier-level LLM. xAI *really* believes in this. For compute, even at Google I have never experienced this # of chips per capita (& 100K+ GB200/300K’s are incoming with Colossus 2). For data, Grok 4 made the biggest bet in scaling RL & posttraining. xAI is making new bets to scale data, deep thinking, and the training recipe. And the team is quick. No company has gotten to where xAI is today in AI capabilities in as little as time. As @elonmusk says, a company’s first- and second-order derivatives are the most important: xAI’s acceleration is the highest. I’m excited to announce that in my first few weeks, we launched Grok 4 Fast. Grok 4 is an amazing reasoning model, still the top on ARC-AGI and new benchmarks like FinSearchComp. But it’s slow and was never really targeted for general-purpose user needs. Grok 4 Fast is the best mini-class model—on LMArena, it is #8 (Gemini 2.5 Flash is #18!), and on core reasoning evals like AIME, it is on par with Grok 4 while 15x cheaper. S/o to @LiTianleli @jinyilll @adityagupta @s_tworkowski @keirp1 @yuhu_ai_
English
386
502
7.9K
13.2M
Mislav Balunović
Mislav Balunović@mbalunovic·
Life update: Excited to share that I’ve recently moved to London and started at @GoogleDeepMind as Research Scientist!
English
22
7
706
67.4K
Mario Lucic retweetledi
Josh Woodward
Josh Woodward@joshwoodward·
Suddenly, college life just changed… Here's how to get the FREE PRO VERSIONS of @GeminiApp, @NotebookLM, and more if you’re a university student in the US, Japan, Korea, Indonesia, or Brazil ⬇️
English
55
103
714
103K
Mario Lucic retweetledi
Roberta Raileanu
Roberta Raileanu@robertarail·
I’m building a new team at @GoogleDeepMind to work on Open-Ended Discovery! We’re looking for strong Research Scientists and Research Engineers to help us push the frontier of autonomously discovering novel artifacts such as new knowledge, capabilities, or algorithms, in an open-ended self-improving loop. We aim to work on ambitious research projects in a fast-paced manner. If this sounds appealing to you, apply using the link below by Friday, August 1st EOD: job-boards.greenhouse.io/deepmind/jobs/…
English
90
256
2.5K
345K
Mario Lucic retweetledi
Demis Hassabis
Demis Hassabis@demishassabis·
Official results are in - Gemini achieved gold-medal level in the International Mathematical Olympiad! 🏆 An advanced version was able to solve 5 out of 6 problems. Incredible progress - huge congrats to @lmthang and the team! deepmind.google/discover/blog/…
English
194
736
6.3K
1.5M