Steven Hansen

2.8K posts

Steven Hansen

@Zergylord

Gemini Agents lead at Google DeepMind.

London, England Katılım Haziran 2009

629 Takip Edilen3.4K Takipçiler

Sabitlenmiş Tweet

Steven Hansen@Zergylord·19 Kas

You can just do things better

Oriol Vinyals@OriolVinyalsML

The secret behind Gemini 3? Simple: Improving pre-training & post-training 🤯 Pre-training: Contra the popular belief that scaling is over—which we discussed in our NeurIPS '25 talk with @ilyasut and @quocleix—the team delivered a drastic jump. The delta between 2.5 and 3.0 is as big as we've ever seen. No walls in sight! Post-training: Still a total greenfield. There's lots of room for algorithmic progress and improvement, and 3.0 hasn't been an exception, thanks to our stellar team. Congratulations to the whole team 💙💙💙

English

1.3K

Steven Hansen@Zergylord·4h

That's what she said.

etn.@etnshow

BREAKING: Sequoia and Lightspeed co-lead Europe's largest seed funding round with $1.1B at $5.1B post for ex-Deepmind David Silver's Ineffable Intelligence. David is committing to giving away 100% of the money he makes from his Ineffable equity via Founders Pledge - the biggest pledge in their history and it is likely to amount to multiple billions.

English

142

Steven Hansen@Zergylord·1d

@cohere What does "sovereign AI for the World" mean? Isn't that just AI?

English

130

Cohere@cohere·3d

🚀 Sovereign AI for the world. Cohere & Aleph Alpha form transatlantic AI powerhouse anchored in Canada & Germany! Combining our global scale with European R&D excellence to build sovereign, enterprise-grade AI. Security, privacy & trust for businesses & governments worldwide. #SovereignAI #AIPartnership Learn more: businesswire.com/news/home/2026… Image from left to right: Rolf Schumann, Schwarz Digits, Samuel Weinbach, Aleph Alpha, Aidan Gomez, Cohere, Minister Solomon, Canada, Minister Wildberger, Germany

English

183

108.1K

Steven Hansen@Zergylord·2d

@AlexanderKalian RL is the big bet

English

Dr Alexander D. Kalian@AlexanderKalian·2d

We trained LLMs to operate at near-human competency, via exposure to an *entire internet* of human discussions, books, research etc. So how does anyone plan to train an "AI superintelligence"? Where can we find an entire internet of superintelligent outputs, for training data?

English

107

180

13.9K

Steven Hansen@Zergylord·2d

@xriskology Why do they need your permission? Are you suggesting we have a vote before anyone can create new technology?

English

244

Dr. Émile P. Torres (they/them)@xriskology·2d

Who asked for this? Why do "fuckers" like roon (quoting Altman) get to unilaterally decide when and how to build this technology?

roon@tszzl

there will this brief era where we can watch our AIs bumble around on the computer clicking things, failing sometimes, taking a ~human amount of time to write code. in the blink of an eye they’ll be manipulating computers far too quickly to monitor

English

212

116.6K

Steven Hansen@Zergylord·2d

@NandoDF @InvincibleEdge @nickclegg @UKParliament @MistralAI @cohere yeah, no argument there, just responding to "the LLM people (Gemini) have moved to California", which isn't accurate

English

110

Nando de Freitas@NandoDF·2d

I agree, but that is not the same as saying saying that there is a UK owned company, with leadership in the UK, building Sota LLMs. I also work on LLMs in London, but that too does not mean the UK has a sovereign AI programme. It is wonderful that London has so many talented people working with great American companies, but that’s again not the same as the UK being in the AI race as a significant player.

English

202

Nando de Freitas@NandoDF·2d

It is ghastly how the UK 🇬🇧 has failed at having its own LLM companies. By doing so it has become irrelevant in the AI race. How do we fix this? @nickclegg @UKParliament France has @MistralAI. Canada has @cohere. Every other LLM AI company is 🇺🇸/🇨🇳

Sam@Discoplomacy

Feel like very few serious people make the argument Britain should develop its own LLM? Certainly not an argument you see made in Westminster often during the sovereignty debate. Also some other eyebrow-raising sections in this interview.

English

26.3K

Steven Hansen@Zergylord·2d

@NandoDF @InvincibleEdge @nickclegg @UKParliament @MistralAI @cohere There are absolutely still LLM people in the London office.

English

195

Nando de Freitas@NandoDF·2d

@InvincibleEdge @nickclegg @UKParliament @MistralAI @cohere Google DeepMind is an American company. The LLM people (Gemini) have moved to California. I was part of AlphaGo (arxiv.org/pdf/1812.06855), Alphacode, Gato, Veo, etc, all built in the UK, but all that is now past.

English

696

Steven Hansen@Zergylord·3d

@litcapital

GIF

QME

379

litquidity@litcapital·3d

The Gemini team seeing Google invest in their largest competitor

Exec Sum@exec_sum

BREAKING: Google plans to invest up to $40 billion in Anthropic.

English

331

2.4K

54.6K

2.7M

Steven Hansen@Zergylord·3d

@SakanaAILabs @hardmaru looks like fugu-mini learned a worse orchestration strategy than "always call Opus"...

English

1.1K

Sakana AI@SakanaAILabs·4d

We’re launching the beta for our new commercial AI product: Sakana Fugu 🐡, a multi-agent orchestration system! Blog: sakana.ai/fugu-beta Fugu hits SOTA on SWE-Pro, GPQA-D, and ALE-Bench, and has been our internal secret weapon. It dynamically coordinates frontier models, autonomously selecting the optimal agent combinations and roles for each task. Available as an OpenAI-compatible API, you can seamlessly integrate Fugu into your existing workflows with minimal changes. 🐟 Fugu Mini: High-speed orchestration optimized for latency 🐡 Fugu Ultra: Full model pool utilization for deep, complex reasoning Apply for the beta test here: forms.gle/BtKkhc2CfLKk1d…

English

145

608

296.6K

Steven Hansen@Zergylord·4d

have started referring to BuzzFeed as 'human slop'

English

141

Steven Hansen@Zergylord·5d

I take offense to this characterization -- the statistics involved is actually pretty simple

Big Brain AI@realBigBrainAI

Oxford AI professor Michael Wooldridge: "ChatGPT doesn't understand anything. It's essentially doing some fancy statistics."

English

625

Steven Hansen@Zergylord·20 Nis

To think, we've had wizard duels all along

Dudes Posting Their W’s@DudespostingWs

Japanese engineers developed a “Sword Tip Visualization System” for the Fencing World Championships, and it makes fencing look absolutely incredible to watch.

English

718

Steven Hansen@Zergylord·20 Nis

Redefining tensor feels harmless, but calling this softmax vs softargmax adds noticeable cognitive load and makes it 10x harder to teach

Alex Shtoff@AlexShtf

And stop calling "𝚎𝚡𝚙(𝚡) / 𝚜𝚞𝚖(𝚎𝚡𝚙(𝚡))" 𝑠𝑜𝑓𝑡𝑚𝑎𝑥. It's 𝑠𝑜𝑓𝑡𝒂𝒓𝒈𝑚𝑎𝑥.

English

542

Steven Hansen@Zergylord·16 Nis

@Miles_Brundage Modelo4

Italiano

Miles Brundage@Miles_Brundage·16 Nis

I hear Grupo Modelo, makers of Modelo Especial beer, are pivoting to frontier model training

English

2.2K

Steven Hansen retweetledi

Google DeepMind@GoogleDeepMind·14 Nis

We’re rolling out an upgrade designed to help robots reason about the physical world. 🤖 Gemini Robotics-ER 1.6 has significantly better visual and spatial understanding in order to plan and complete more useful tasks. Here’s why this is important 🧵

English

141

422

2.6K

541.2K

Steven Hansen@Zergylord·13 Nis

@SebJohnsonUK I think anthropic is around here as well

English

266

Seb Johnson@SebJohnsonUK·13 Nis

King's Cross is becoming the leading AI hub outside of the US. It houses Meta, DeepMind, Wayve, Synthesia and now OpenAI.

CNBC@CNBC

OpenAI announces first permanent London office after halting UK Stargate project cnbc.com/2026/04/13/ope…

English

125

1.6K

148.2K

Steven Hansen@Zergylord·13 Nis

@Steve_Yegge

GIF

QME

905

Steve Yegge@Steve_Yegge·13 Nis

I was chatting with my buddy at Google, who's been a tech director there for about 20 years, about their AI adoption. Craziest convo I've had all year. The TL;DR is that Google engineering appears to have the same AI adoption footprint as John Deere, the tractor company. Most of the industry has the same internal adoption curve: 20% agentic power users, 20% outright refusers, 60% still using Cursor or equivalent chat tool. It turns out Google has this curve too. But why is Google so... average? How is it that a handful of companies are taking off like a spaceship, and the rest, including Google, are mired in inaction? My buddy's observation was key here: There has been an industry-wide hiring freeze for 18+ months, during which time nobody has been moving jobs. So there are no clued-in people coming in from the outside to tell Google how far behind they are, how utterly mediocre they have become as an eng org. He says the problem is that they can't use Claude Code because it's the enemy, and Gemini has never been good enough to capture people's workflows like Claude has, so basically agentic coding just never really took off inside Google. They're all just plodding along, completely oblivious to what's happening out there right now. Not only is Google not able to do anything about it, they don't seem to be aware of the problem at all. I'm having major flashbacks to fifty years ago as a kid at the La Brea Tar Pits, asking, "why can't they just climb out?" My Google friend and I had this conversation over a month ago. I didn't share it because I wanted to look around a bit, and see if it's really as bad as all that. I've been talking to people from dozens of companies since then. And yeah. It's as bad as all that. Google is about average. Some companies at the bottom have near-zero AI adoption and can't even get budget for AI. They may have moats and high walls, but the horde is coming for them all the same. And then there are a few companies I've met recently who are *amazingly* leaned in to AI adoption. One category-leader company just cancelled IntelliJ for a thousand engineers. That's an incredibly bold move, one of many they're making towards agentic adoption. In my opinion, that company is setting themselves up for a _huge_ W. As for the rest, well, it's the Great Siloing. Everyone's flying blind. With nobody moving companies, no company knows where they stand on the AI adoption curve. Nobody knows how they're doing compared to everyone else. Half of them just check a box: "We enabled {Copilot/Cursor} for everyone!" Cue smug celebrations. They think this is like getting SOC2 compliance, just a thing they turn on and now it's "solved." And they don't realize that they've done effectively nothing at all. All because of a hiring freeze.

English

536

470

5.4K

2.8M

Steven Hansen@Zergylord·10 Nis

@tallinzen note that 100x AI growth would still be less than corn

English

143

Tal Linzen@tallinzen·9 Nis

The same kind of AI optimist will simultaneously say, "AI is growing exponentially and therefore it's not a bubble if we spend four trillion dollars expanding our data center capacity by a factor of 100 over the next five years", and "AI water usage is not a big deal because in 2025 it wasn't very much"

Alec Stapp@AlecStapp

English

7.4K

Steven Hansen@Zergylord·9 Nis

The only way forward for America is for Dario to beat Pete in a pushup competition and then ask for his resignation.

Arjun Narayan@narayanarjun

It should be a top EA priority for Dario to get jacked. 2-3 hours gym a day, full Hollywood steroid course, etc. And be able to go on Rogan and yap for 3 hours about why he can be trusted. The alternative is nationalization, and he should be able to see that.

English

417

Steven Hansen@Zergylord·9 Nis

@GaryMarcus @Fhotec I think humans conflate these too though Would love to know if there's data on capability perceptions for other consumer products -- maybe I'm wrong, but I suspect these would generally go down over time

English

Gary Marcus@GaryMarcus·9 Nis

@Zergylord @Fhotec the graph is about *perceived capability* not intrigue or excitement or fun or whatever. i think you are conflating enthusiasm with capability assessment.

English

Gary Marcus@GaryMarcus·8 Nis

1. If true (and it does fit with my perceptions FWIW), this is an amazing and incredibly damning graph 2. Can anyone find the source on which it is based?

Marcin Krzyzanowski@krzyzanowskim

"Anthropic, OpenAl and Google release their new models with high quality from day one then slowly nerf them until the next model, so when the next model hits, its perceived as a bigger jump than it actually is" sounds right what's happening

English

227

26.6K

Steven Hansen@Zergylord·9 Nis

@GaryMarcus @Fhotec Is there anything where this isn't the case though? e.g. the iPhone1 was amazing, but then we got used to it (despite it still being amazing)

English

Gary Marcus@GaryMarcus·8 Nis

@Fhotec or a psychological effect of initial enthusiasm and hype gradually facing reality?

English

1.1K

Keşfet

@cohere @AlexanderKalian @xriskology @NandoDF @InvincibleEdge @nickclegg @UKParliament @MistralAI