Sergi Caelles

237 posts

Sergi Caelles

Sergi Caelles

@skprat

Staff Research Engineer, Video Understanding in Gemini Team, @GoogleDeepMind. Prev: PhD student in #ComputerVision @ETH Zurich

Zurich, Switzerland Katılım Nisan 2010
77 Takip Edilen947 Takipçiler
Sergi Caelles retweetledi
Similarweb
Similarweb@Similarweb·
Gen AI traffic share update Main takeaways: → Gemini holds a quarter of the share. → Claude almost doubled its share between February and March, crossing the 6% mark. → DeepSeek surpassed Grok again. → We’ve added m365.cloud.microsoft/chat to Copilot’s share, which explains the increase from previous updates. 🗓️ 12 months ago: ChatGPT: 77.43% Grok: 7.03% Gemini: 6.00% DeepSeek: 3.73% Perplexity: 1.66% Claude: 1.40% Copilot: 1.38% 🗓️ 6 months ago: ChatGPT: 71.75% Gemini: 13.56% Grok: 4.08% DeepSeek: 2.81% Perplexity: 2.51% Claude: 2.26% Copilot: 2.01% 🗓️ 3 months ago: ChatGPT: 63.19% Gemini: 22.59% DeepSeek: 4.08% Grok: 3.26% Claude: 2.22% Perplexity: 1.86% Copilot: 1.80% 🗓️ 1 month ago: ChatGPT: 56.72% Gemini: 25.46% Claude: 6.02% DeepSeek: 3.74% Grok: 3.44% Copilot: 1.99% Perplexity: 1.64%
Similarweb tweet media
English
34
119
575
286K
Sergi Caelles retweetledi
Similarweb
Similarweb@Similarweb·
Our GenAI traffic share update is back. This chart tracks the monthly traffic share of leading AI tools worldwide. Main takeaways: → As of February, Grok and Claude surpassed DeepSeek, taking 3rd and 4th place respectively. → Claude crossed the 3% mark for the first time in February. → Gemini is approaching a quarter of the total share. 🗓️ 12 months ago: ChatGPT: 75.7% DeepSeek: 8.5% Gemini: 5.7% Grok: 3.4% Perplexity: 2.1% Claude: 1.7% Copilot: 1.3% 🗓️ 6 months ago: ChatGPT: 74.0% Gemini: 13.3% DeepSeek: 4.2% Grok: 2.2% Perplexity: 2.1% Claude: 2.0% Copilot: 1.2% 🗓️ 3 months ago: ChatGPT: 65.8% Gemini: 20.7% DeepSeek: 3.9% Grok: 3.2% Perplexity: 2.1% Claude: 2.1% Copilot: 1.2% 🗓️ 1 month ago: ChatGPT: 61.7% Gemini: 24.4% Grok: 3.4% Claude: 3.3% DeepSeek: 3.2% Perplexity: 1.8% Copilot: 1.1% >>
Similarweb tweet media
English
33
76
403
95.8K
Sergi Caelles retweetledi
Demis Hassabis
Demis Hassabis@demishassabis·
Excited to launch Gemini 3.1 Pro! Major improvements across the board including in core reasoning and problem solving. For example scoring 77.1% on the ARC-AGI-2 benchmark - more than 2x the performance of 3 Pro. Rolling out today in @GeminiApp, @antigravity and more - enjoy!
Demis Hassabis tweet media
English
246
418
4.9K
247.5K
Sergi Caelles retweetledi
Sergi Caelles retweetledi
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
Gemini 3 Flash on the @ArtificialAnlys intelligence benchmark, the most cost per intelligence efficient model in the world!!!
Logan Kilpatrick tweet media
English
66
102
1.3K
96.4K
Sergi Caelles retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
Gemini 3 Flash gives you frontier intelligence at a fraction of the cost. ⚡ Here’s how it’s built for speed and scale 🧵
GIF
English
113
260
1.9K
505.9K
Sergi Caelles retweetledi
Wes Roth
Wes Roth@WesRoth·
Demis Hassabis says the most ignored marvel is AI’s ability to understand video, images, and audio together. Gemini can watch a movie scene and explain the symbolism behind a tiny gesture. This shows the model grasps concepts, not just pixels or words. Such deep cross-media reasoning is still under-appreciated outside AI circles.
English
40
107
899
53.8K
Sergi Caelles retweetledi
Google AI Developers
Google AI Developers@googleaidevs·
Gemini 3 Pro is the frontier of multimodal AI, delivering SOTA performance across document, screen, spatial, and video understanding. Read our deep dive on how we’ve pushed our core capabilities to power hero use cases across: + Docs: "derender" complex docs into structured code (HTML/LaTeX) + Screen: build robust computer agents that automate complex tasks + Spatial: generate collision-free trajectories for robotics & XR + Video: analyze sports footage using high-FPS processing with "thinking" mode See how these capabilities are transforming workflows in education, biomedical, and law/finance → goo.gle/3Mt3UlT
Google AI Developers tweet media
English
45
137
1.1K
329.8K
Sergi Caelles retweetledi
Chubby♨️
Chubby♨️@kimmonismus·
Google cooked so hard. Not gonna lie, this feels like the future is here. Now develop Google Glasses with enough battery power, a good chip, and a look like Ray-Bans, and you'll have an instant hit. 100%.
English
480
2.1K
17.4K
3.1M
Sergi Caelles retweetledi
Oriol Vinyals
Oriol Vinyals@OriolVinyalsML·
The secret behind Gemini 3? Simple: Improving pre-training & post-training 🤯 Pre-training: Contra the popular belief that scaling is over—which we discussed in our NeurIPS '25 talk with @ilyasut and @quocleix—the team delivered a drastic jump. The delta between 2.5 and 3.0 is as big as we've ever seen. No walls in sight! Post-training: Still a total greenfield. There's lots of room for algorithmic progress and improvement, and 3.0 hasn't been an exception, thanks to our stellar team. Congratulations to the whole team 💙💙💙
Oriol Vinyals tweet media
English
119
549
4.4K
2M
Sergi Caelles retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
Our first release is Gemini 3 Pro, which is rolling out globally starting today. It significantly outperforms 2.5 Pro across the board: 🥇 Tops LMArena and WebDev @arena leaderboards 🧠 PhD-level reasoning on Humanity’s Last Exam 📋 Leads long-horizon planning on Vending-Bench 2
Google DeepMind tweet media
English
17
108
916
267.9K
Sergi Caelles retweetledi
Arena.ai
Arena.ai@arena·
👨‍💻Have you tested models in the new Code Arena yet? In this thread, we’re showcasing real Gemini 3 Pro by @GoogleDeepMind creations and the exact prompts used, all of them built inside the Code Arena. You can directly compare Gemini’s outputs against other top frontier models on real web development tasks. Build, compare, vote, and share your own creations directly from the chat. See examples in the thread. 🧵
Arena.ai@arena

🚨BREAKING: @GoogleDeepMind’s Gemini-3-Pro is now #1 across all major Arena leaderboards 🥇#1 in Text, Vision, and WebDev - surpassing Grok-4.1, Claude-4.5, and GPT-5 🥇#1 in Coding, Math, Creative Writing, Long Queries, and nearly all occupational leaderboards. Massive gains over Gemini-2.5: 🔸WebDev in Code Arena: 1487 (+280 pts vs 2.5) 🔸Text: 1501 (+50 pts) 🔸Vision: 1328 (+70 pts) 🔸Arena Expert: Top-3 (just 3 pts behind #1) Huge congrats to the @GoogleDeepMind team on this breakthrough! 👏

English
3
24
232
25.3K
Sergi Caelles retweetledi
Jeff Dean
Jeff Dean@JeffDean·
Exciting expansion! @Waymo now serves the whole SF Bay Area Peninsula from SF to San Jose and is taking riders on freeways. waymo.com/blog/2025/11/t…
Jeff Dean tweet media
English
435
647
11.4K
7.2M
Sergi Caelles retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
Image generation with Gemini just got a bananas upgrade and is the new state-of-the-art image generation and editing model. 🤯 From photorealistic masterpieces to mind-bending fantasy worlds, you can now natively produce, edit and refine visuals with new levels of reasoning, control and creativity. A quick dive into Gemini 2.5 Flash’s capabilities 🧵
English
176
530
3K
1.5M
Sergi Caelles retweetledi
Demis Hassabis
Demis Hassabis@demishassabis·
One word: relentless. just in the past two weeks, we’ve shipped: 🌐 Genie 3 - the most advanced world simulator ever 🤔 Gemini 2.5 Pro Deep Think available to Ultra subs 🎓 Gemini Pro free for uni students & $1B for US ed 🌍 AlphaEarth - a geospatial model of the entire planet 🏛️ Aeneas - deciphering ancient text (in @Nature) 🥇 Gemini gold-medal level at the IMO 🧸 Storybook - books w/art & audio @GeminiApp ♛ New @Kaggle Game Arena benchmark for LLMs 🐙 Jules, our asynchronous coding agent, out of Beta 🇬🇧 AI Mode for Search available in the UK 📔 NotebookLM Video Overviews 🔥 Gemma passed 200m downloads Now you know why I don't get much sleep 🛌 - too busy pushing the frontier!
English
484
929
9.6K
1.1M
Sergi Caelles retweetledi
Ani Baddepudi
Ani Baddepudi@AniBaddepudi·
gemini's still the only frontier model that supports native video input (and is amazing at it!) incredible amount of real-world utility given how much of the world's information is increasingly in video
Ani Baddepudi tweet media
English
22
15
297
20.8K
Sergi Caelles retweetledi
Andi Marafioti
Andi Marafioti@andimarafioti·
The results are in, and they're revealing. Only Gemini 2.5 pro handles 1-hour-long videos. Performance drops sharply with duration, proving that long video understanding is still challenging. We've found the breaking points—now the community can start fixing them.📈
Andi Marafioti tweet media
English
4
2
32
3.3K
Sergi Caelles retweetledi
Demis Hassabis
Demis Hassabis@demishassabis·
Official results are in - Gemini achieved gold-medal level in the International Mathematical Olympiad! 🏆 An advanced version was able to solve 5 out of 6 problems. Incredible progress - huge congrats to @lmthang and the team! deepmind.google/discover/blog/…
English
193
738
6.3K
1.5M
Sergi Caelles retweetledi
AK
AK@_akhaliq·
Gemini 2.5 Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
AK tweet media
English
5
27
138
19.1K