Steven Hansen

2.8K posts

Steven Hansen banner
Steven Hansen

Steven Hansen

@Zergylord

Gemini Agents lead at Google DeepMind.

London, England Katılım Haziran 2009
629 Takip Edilen3.4K Takipçiler
Sabitlenmiş Tweet
Steven Hansen
Steven Hansen@Zergylord·
You can just do things better
Oriol Vinyals@OriolVinyalsML

The secret behind Gemini 3? Simple: Improving pre-training & post-training 🤯 Pre-training: Contra the popular belief that scaling is over—which we discussed in our NeurIPS '25 talk with @ilyasut and @quocleix—the team delivered a drastic jump. The delta between 2.5 and 3.0 is as big as we've ever seen. No walls in sight! Post-training: Still a total greenfield. There's lots of room for algorithmic progress and improvement, and 3.0 hasn't been an exception, thanks to our stellar team. Congratulations to the whole team 💙💙💙

English
0
0
5
1.3K
Steven Hansen
Steven Hansen@Zergylord·
@cohere What does "sovereign AI for the World" mean? Isn't that just AI?
English
0
0
0
130
Cohere
Cohere@cohere·
🚀 Sovereign AI for the world. Cohere & Aleph Alpha form transatlantic AI powerhouse anchored in Canada & Germany! Combining our global scale with European R&D excellence to build sovereign, enterprise-grade AI. Security, privacy & trust for businesses & governments worldwide. #SovereignAI #AIPartnership Learn more: businesswire.com/news/home/2026… Image from left to right: Rolf Schumann, Schwarz Digits, Samuel Weinbach, Aleph Alpha, Aidan Gomez, Cohere, Minister Solomon, Canada, Minister Wildberger, Germany
Cohere tweet media
English
8
24
183
108.1K
Dr Alexander D. Kalian
Dr Alexander D. Kalian@AlexanderKalian·
We trained LLMs to operate at near-human competency, via exposure to an *entire internet* of human discussions, books, research etc. So how does anyone plan to train an "AI superintelligence"? Where can we find an entire internet of superintelligent outputs, for training data?
English
107
11
180
13.9K
Steven Hansen
Steven Hansen@Zergylord·
@xriskology Why do they need your permission? Are you suggesting we have a vote before anyone can create new technology?
English
0
0
2
244
Nando de Freitas
Nando de Freitas@NandoDF·
I agree, but that is not the same as saying saying that there is a UK owned company, with leadership in the UK, building Sota LLMs. I also work on LLMs in London, but that too does not mean the UK has a sovereign AI programme. It is wonderful that London has so many talented people working with great American companies, but that’s again not the same as the UK being in the AI race as a significant player.
English
1
0
4
202
Nando de Freitas
Nando de Freitas@NandoDF·
It is ghastly how the UK 🇬🇧 has failed at having its own LLM companies. By doing so it has become irrelevant in the AI race. How do we fix this? @nickclegg @UKParliament France has @MistralAI. Canada has @cohere. Every other LLM AI company is 🇺🇸/🇨🇳
Sam@Discoplomacy

Feel like very few serious people make the argument Britain should develop its own LLM? Certainly not an argument you see made in Westminster often during the sovereignty debate. Also some other eyebrow-raising sections in this interview.

English
23
4
59
26.3K
Sakana AI
Sakana AI@SakanaAILabs·
We’re launching the beta for our new commercial AI product: Sakana Fugu 🐡, a multi-agent orchestration system! Blog: sakana.ai/fugu-beta Fugu hits SOTA on SWE-Pro, GPQA-D, and ALE-Bench, and has been our internal secret weapon. It dynamically coordinates frontier models, autonomously selecting the optimal agent combinations and roles for each task. Available as an OpenAI-compatible API, you can seamlessly integrate Fugu into your existing workflows with minimal changes. 🐟 Fugu Mini: High-speed orchestration optimized for latency 🐡 Fugu Ultra: Full model pool utilization for deep, complex reasoning Apply for the beta test here: forms.gle/BtKkhc2CfLKk1d…
Sakana AI tweet media
English
17
145
608
296.6K
Steven Hansen
Steven Hansen@Zergylord·
have started referring to BuzzFeed as 'human slop'
English
0
0
0
141
Miles Brundage
Miles Brundage@Miles_Brundage·
I hear Grupo Modelo, makers of Modelo Especial beer, are pivoting to frontier model training
English
8
0
13
2.2K
Steven Hansen retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
We’re rolling out an upgrade designed to help robots reason about the physical world. 🤖 Gemini Robotics-ER 1.6 has significantly better visual and spatial understanding in order to plan and complete more useful tasks. Here’s why this is important 🧵
English
141
422
2.6K
541.2K
Steve Yegge
Steve Yegge@Steve_Yegge·
I was chatting with my buddy at Google, who's been a tech director there for about 20 years, about their AI adoption. Craziest convo I've had all year. The TL;DR is that Google engineering appears to have the same AI adoption footprint as John Deere, the tractor company. Most of the industry has the same internal adoption curve: 20% agentic power users, 20% outright refusers, 60% still using Cursor or equivalent chat tool. It turns out Google has this curve too. But why is Google so... average? How is it that a handful of companies are taking off like a spaceship, and the rest, including Google, are mired in inaction? My buddy's observation was key here: There has been an industry-wide hiring freeze for 18+ months, during which time nobody has been moving jobs. So there are no clued-in people coming in from the outside to tell Google how far behind they are, how utterly mediocre they have become as an eng org. He says the problem is that they can't use Claude Code because it's the enemy, and Gemini has never been good enough to capture people's workflows like Claude has, so basically agentic coding just never really took off inside Google. They're all just plodding along, completely oblivious to what's happening out there right now. Not only is Google not able to do anything about it, they don't seem to be aware of the problem at all. I'm having major flashbacks to fifty years ago as a kid at the La Brea Tar Pits, asking, "why can't they just climb out?" My Google friend and I had this conversation over a month ago. I didn't share it because I wanted to look around a bit, and see if it's really as bad as all that. I've been talking to people from dozens of companies since then. And yeah. It's as bad as all that. Google is about average. Some companies at the bottom have near-zero AI adoption and can't even get budget for AI. They may have moats and high walls, but the horde is coming for them all the same. And then there are a few companies I've met recently who are *amazingly* leaned in to AI adoption. One category-leader company just cancelled IntelliJ for a thousand engineers. That's an incredibly bold move, one of many they're making towards agentic adoption. In my opinion, that company is setting themselves up for a _huge_ W. As for the rest, well, it's the Great Siloing. Everyone's flying blind. With nobody moving companies, no company knows where they stand on the AI adoption curve. Nobody knows how they're doing compared to everyone else. Half of them just check a box: "We enabled {Copilot/Cursor} for everyone!" Cue smug celebrations. They think this is like getting SOC2 compliance, just a thing they turn on and now it's "solved." And they don't realize that they've done effectively nothing at all. All because of a hiring freeze.
English
536
470
5.4K
2.8M
Tal Linzen
Tal Linzen@tallinzen·
The same kind of AI optimist will simultaneously say, "AI is growing exponentially and therefore it's not a bubble if we spend four trillion dollars expanding our data center capacity by a factor of 100 over the next five years", and "AI water usage is not a big deal because in 2025 it wasn't very much"
Alec Stapp@AlecStapp

English
6
0
18
7.4K
Steven Hansen
Steven Hansen@Zergylord·
@GaryMarcus @Fhotec I think humans conflate these too though Would love to know if there's data on capability perceptions for other consumer products -- maybe I'm wrong, but I suspect these would generally go down over time
English
0
0
0
18
Gary Marcus
Gary Marcus@GaryMarcus·
@Zergylord @Fhotec the graph is about *perceived capability* not intrigue or excitement or fun or whatever. i think you are conflating enthusiasm with capability assessment.
English
1
0
0
86
Steven Hansen
Steven Hansen@Zergylord·
@GaryMarcus @Fhotec Is there anything where this isn't the case though? e.g. the iPhone1 was amazing, but then we got used to it (despite it still being amazing)
English
1
0
0
63
Gary Marcus
Gary Marcus@GaryMarcus·
@Fhotec or a psychological effect of initial enthusiasm and hype gradually facing reality?
English
5
0
17
1.1K