Mario Lucic

496 posts

Mario Lucic

@MarioLucic_

Pushing the frontier of multimodal intelligence in Gemini. Director, Research Scientist at https://t.co/osxwuzykCr.

Zurich, Switzerland Katılım Aralık 2018

253 Takip Edilen3K Takipçiler

Sabitlenmiş Tweet

Mario Lucic@MarioLucic_·9 May

Massive advancements in video understanding with Gemini 2.5! ✨ Unlock new capabilities to process hours of video, summarize and retrieve key moments, generate animations, and even combine video with code for interactive experiences. Check the 🧵below for some cool use-cases.

Antoine Yang@AntoineYang2

Thrilled to share our latest advances in video understanding 📽️: Gemini 2.5 Pro is a truly magical model to play with, excelling in traditional video analysis and unlocking new use cases I could not imagine a few months ago🪄 More in 🧵 and @Google blog: developers.googleblog.com/en/gemini-2-5-…

English

2.9K

Mario Lucic@MarioLucic_·31 Mar

@TimSalimans Enjoy Tim!

English

326

Tim Salimans@TimSalimans·31 Mar

Touching down now in SF for onboarding at Anthropic! After 7 great years at Google, I'm excited to take on a new challenge and help make Claude even better. Grateful for everything I learned at Google DeepMind and Brain, looking forward to what's next.

English

1.2K

255.4K

Mario Lucic@MarioLucic_·5 Mar

@obousquet @MistralAI Happy to see you are not out of the game! All the best! 🚀

English

480

Olivier Bousquet@obousquet·5 Mar

I am excited to share that I have started a new adventure at @MistralAI, a leading frontier lab, where I am working on pushing further the agentic reasoning capabilities of LLMs.

English

718

74.3K

Ian Goodfellow@goodfellow_ian·23 Şub

I'd like to thank @daniel_rossett for his help in my recovery from the POTS version of Long COVID. Daniel was key in bringing me back from highly disabled and suffering to being able to do what I want to again. This X account is mostly focused on ML / AI. From that point of view, many of you know that in December 2024, I wasn't able to do the test of time award talk at NeurIPS, even by video call. Daniel started working with me in March 2025. By April, I started to have days of no POTS symptoms, by June I was off all heart rate lowering medications, by September I was back to work. I'm back to full exercise, running, lifting weights, mountain biking, and have even done things I hadn't done before I got sick, like riding Whistler Mountain Bike Park. I'm now getting the word out to help Daniel build a company that will bring this approach to more people.

English

170

2.7K

202.5K

Mario Lucic@MarioLucic_·24 Şub

@goodfellow_ian @daniel_rossett Excellent news Ian!! 🥳

English

135

Mario Lucic retweetledi

Rohan Doshi@RohanLikesAI·27 Oca

🚀 Excited to officially launch 👁Agentic Vision via Gemini 3 Flash. Gemini can run code execution on image uploads to zoom, analyze, and annotate: 🔍 Zoom: 5-10% quality win across vision benchmarks 🧮 Analyze: do image math with code (e.g. calculate the tip for a receipt) ✏️ Annotate: Draw arrows or bounding boxes to answer questions Try via the Gemini API (AI Studio / Vertex) or via the Gemini App (rolling out to Thinking mode today). Learn more→ goo.gle/4bsKdFv Demo: goo.gle/3Z05KxK cc: @IoanaBica95 @anastasija56572 @jalayrac @bcaine @eisenjulian @weichengkuo @phillip_lippe @xf1280 @tulseedoshi @BiboXu @OfficialLoganK

Google AI Developers@googleaidevs

Try 👁 Agentic Vision with Gemini 3 Flash in @GoogleAIStudio or Vertex AI. This new capability enables the model to effectively use code and reasoning to improve performance for common vision tasks. See Agentic Vision in action: goo.gle/3Z05KxK

English

237

29.3K

Mario Lucic retweetledi

Google AI Developers@googleaidevs·27 Oca

English

115

862

170K

Mario Lucic retweetledi

Fei Xia@xf1280·18 Ara

🚀Excited to share that #Gemini 3 Flash can do code execution on images to zoom, count, and annotate visual inputs! The model can choose when to write code to: 🔍 Zoom & Inspect: Detect when details are too small and zoom-in. 🧮 Compute Visually: Run multi-step calculations using code (e.g., summing line items on a receipt). ✏️ Annotate: Draw arrows or bounding boxes to answer questions or show relationships between objects.

English

18.8K

Mario Lucic retweetledi

Rohan Doshi@RohanLikesAI·17 Ara

🚀🚀🚀 Gemini 3 Flash is live⚡️⚡️⚡️ 🧠 Gemini 3 Pro-level intelligence, but 📉 4x Cheaper and 🏎️ 3x Faster. We pushed the Pareto frontier, unlocking a new generation of multimodal agents. 🤯 We’re already seeing @figma accelerate rapid prototyping, @Harveyaisol master complex legal reasoning, and @Bridgewater unlock investment insights from massive unstructured datasets. 🤖Try now at gemini.google.com or start building at aistudio.google.com. Read more at lnkd.in/gsPv542a 👏🏼Huge shoutout to the team! @jalayrac @RSoricut @bcaine @xf1280 @IoanaBica95 @MarioLucic_ @BiboXu @tulseedoshi @OfficialLoganK @joshwoodward @demishassabis

English

119

5.2K

Mario Lucic retweetledi

Rohan Doshi@RohanLikesAI·5 Ara

🚀 Just released a deep dive on how Gemini 3 Pro is pushing the frontier of multimodal AI. Dig into new capabilities across doc, spatial, screen, & video understanding. And learn about new use cases across 💡 education, 🤖 robotics, 🩻 biomedical, & 📄 legal/finance → goo.gle/3Mt3UlT As always, I'm grateful to work with the best: @jalayrac @RSoricut @bcaine @xf1280 @MarioLucic_ @BiboXu @tulseedoshi @OfficialLoganK @joshwoodward @JeffDean @OriolVinyalsML @demishassabis

Google AI Developers@googleaidevs

Gemini 3 Pro is the frontier of multimodal AI, delivering SOTA performance across document, screen, spatial, and video understanding. Read our deep dive on how we’ve pushed our core capabilities to power hero use cases across: + Docs: "derender" complex docs into structured code (HTML/LaTeX) + Screen: build robust computer agents that automate complex tasks + Spatial: generate collision-free trajectories for robotics & XR + Video: analyze sports footage using high-FPS processing with "thinking" mode See how these capabilities are transforming workflows in education, biomedical, and law/finance → goo.gle/3Mt3UlT

English

Mario Lucic retweetledi

Google AI Developers@googleaidevs·5 Ara

English

133

1.1K

328.6K

Mario Lucic retweetledi

Chubby♨️@kimmonismus·2 Ara

Google cooked so hard. Not gonna lie, this feels like the future is here. Now develop Google Glasses with enough battery power, a good chip, and a look like Ray-Bans, and you'll have an instant hit. 100%.

English

481

2.1K

17.4K

3.1M

Mario Lucic retweetledi

Mostafa Dehghani@m__dehghani·20 Kas

Thinking (test-time compute) in pixel space... 🍌 Pro tip: always peek at the thoughts if you use AI Studio. Watching the model think in pictures is really fun!

English

696

135.8K

Mario Lucic retweetledi

Google DeepMind@GoogleDeepMind·18 Kas

Our first release is Gemini 3 Pro, which is rolling out globally starting today. It significantly outperforms 2.5 Pro across the board: 🥇 Tops LMArena and WebDev @arena leaderboards 🧠 PhD-level reasoning on Humanity’s Last Exam 📋 Leads long-horizon planning on Vending-Bench 2

English

108

914

266.1K

Mario Lucic retweetledi

Rohan Doshi@RohanLikesAI·18 Kas

🚀 We just launched Gemini 3 Pro — the strongest multimodal understanding model ever built. I lead product for Gemini’s multimodal vision capabilities, and I want to share more about the massive wins we are seeing across document, screen, spatial, and video understanding. 🧵

English

9.4K

Mario Lucic retweetledi

Logan Kilpatrick@OfficialLoganK·23 Eyl

Introducing our latest Gemini Live model 🔊, built on all the things you love about Gemini, with significantly improved function calling and more natural feeling / sounding conversations (thanks to native audio)! Try out the new model at ai.studio/live

English

105

144

1.9K

205.8K

Mario Lucic@MarioLucic_·21 Eyl

@dustinvtran @NoamShazeer @ashVaswani @lukaszkaiser Good luck Dustin 🫡

English

198

Dustin Tran@dustinvtran·20 Eyl

I departed Google DeepMind after 8 years. So many fond memories—from early foundational papers in Google Brain (w/ @noamshazeer @ashvaswani @lukaszkaiser on Image Transformer, Tensor2Tensor, Mesh TensorFlow) to lead Gemini posttraining evals to catch up & launch in 100 days, then leading the team to leapfrog to LMArena #1 (and stay there for over a year!), and finally working on the incredible reasoning innovations for Gemini’s IMO & ICPC gold medals (w/ @HengTze @quocleix). Gemini has been a wild journey from one paradigm to another: first, revamping our LaMDA model (the first instruction-like chatbot!) from an actual chatbot to long contentful responses with RLHF; then, reasoning and deep thinking by training over long thinking chains, novel environments, and reward heads. When we first started, public sentiment was bad. Everyone thought Google was doomed to fail due to its search legacy and organizational politics. Now, Gemini is consistently #1 in user preference and spearheading new scientific accomplishments, and everyone thinks Google winning is obvious. 😂 (It also used to be the case that OpenAI would jump the AI newscycle by announcing before us from a backlog of ideas for every new Google release; safe to say that backlog is empty.) I have since joined xAI. The recipe is well-known. Compute, data, and O(100) brilliant, hard-working people are all that’s needed to obtain a frontier-level LLM. xAI *really* believes in this. For compute, even at Google I have never experienced this # of chips per capita (& 100K+ GB200/300K’s are incoming with Colossus 2). For data, Grok 4 made the biggest bet in scaling RL & posttraining. xAI is making new bets to scale data, deep thinking, and the training recipe. And the team is quick. No company has gotten to where xAI is today in AI capabilities in as little as time. As @elonmusk says, a company’s first- and second-order derivatives are the most important: xAI’s acceleration is the highest. I’m excited to announce that in my first few weeks, we launched Grok 4 Fast. Grok 4 is an amazing reasoning model, still the top on ARC-AGI and new benchmarks like FinSearchComp. But it’s slow and was never really targeted for general-purpose user needs. Grok 4 Fast is the best mini-class model—on LMArena, it is #8 (Gemini 2.5 Flash is #18!), and on core reasoning evals like AIME, it is on par with Grok 4 while 15x cheaper. S/o to @LiTianleli @jinyilll @adityagupta @s_tworkowski @keirp1 @yuhu_ai_

English

386

502

7.9K

13.2M

Mario Lucic@MarioLucic_·9 Eyl

@mbalunovic @GoogleDeepMind Good stuff, welcome 🎉🎉

English

112

Mislav Balunović@mbalunovic·8 Eyl

Life update: Excited to share that I’ve recently moved to London and started at @GoogleDeepMind as Research Scientist!

English

706

67.4K

Mario Lucic retweetledi

Josh Woodward@joshwoodward·6 Ağu

Suddenly, college life just changed… Here's how to get the FREE PRO VERSIONS of @GeminiApp, @NotebookLM, and more if you’re a university student in the US, Japan, Korea, Indonesia, or Brazil ⬇️

English

103

714

103K

Mario Lucic retweetledi

Roberta Raileanu@robertarail·24 Tem

I’m building a new team at @GoogleDeepMind to work on Open-Ended Discovery! We’re looking for strong Research Scientists and Research Engineers to help us push the frontier of autonomously discovering novel artifacts such as new knowledge, capabilities, or algorithms, in an open-ended self-improving loop. We aim to work on ambitious research projects in a fast-paced manner. If this sounds appealing to you, apply using the link below by Friday, August 1st EOD: job-boards.greenhouse.io/deepmind/jobs/…

English

256

2.5K

345K

Mario Lucic retweetledi

Demis Hassabis@demishassabis·21 Tem

Official results are in - Gemini achieved gold-medal level in the International Mathematical Olympiad! 🏆 An advanced version was able to solve 5 out of 6 problems. Incredible progress - huge congrats to @lmthang and the team! deepmind.google/discover/blog/…

English

194

736

6.3K

1.5M

Keşfet

@TimSalimans @obousquet @MistralAI @daniel_rossett @goodfellow_ian @IoanaBica95 @anastasija56572 @jalayrac