Dr Waku

209 posts

Dr Waku

@DrWakuAI

YouTuber, AI research scientist, computer science PhD. I talk about how AI will affect all of us and society as a whole.

Canada Katılım Temmuz 2016

213 Takip Edilen1.1K Takipçiler

Dr Waku retweetledi

Max Tegmark@tegmark·13 Kas

If a country or company builds superintelligence, they'll and up *losing* power, not gaining power – this thought-provoking study explains why:

Anthony Aguirre@AnthonyNAguirre

Superintelligence, if we develop it using anything like current methods, would not be under meaningful human control. That's the bottom-line of a new study I've put out entitled Control Inversion (link in second post.) Many experts I talk to who take superintelligence (real, general-purpose, autonomous superintelligence) seriously agree with this. But I wanted to take a deep dive to assess whether I think it's true, and why. Unfortunately, I'm now more convinced than ever. The basic argument is laid out below, but the core implication is worth putting up here: the global race to superintelligence is fundamentally misguided. Companies and countries are rushing to be first, believing whoever builds superintelligence will "grab the prize" of unprecedented power and wealth. This is dangerously wrong. Superintelligent systems would not bestow power on their creators, they would absorb it. Even if superintelligence does not "go rogue" (which it might), humans – including superintelligence's creators – would find themselves sidelined as it makes decisions faster than them, with more complex plans, and with strategic foresight beyond human comprehension. Whether quickly or slowly, losing control of superintelligence would inexorably lead to losing control to superintelligence. If humanity wants to stay in the driver's seat of our civilization, we need to give up this race. So what does the paper say?

English

151

22.6K

Dr Waku@DrWakuAI·20 Eki

Apparently, donating today is particularly helpful for the campaign. ericneyman.wordpress.com/2025/10/20/con… Direct donation link: secure.actblue.com/donate/boresai…

English

296

Dr Waku@DrWakuAI·20 Eki

If you're American and want to support AI safety, you can donate to Alex Bores's campaign for Congress which started today. He is responsible for the RAISE bill in NY, which has been very helpful for AI safety.

English

691

Dr Waku@DrWakuAI·16 Eyl

@sinuous_grace I saw someone feed the PDF of the book to a bunch of LLMs (including many commercial ones) and ask them to produce a one sentence summary. Which means it's in their training data now if they want it. Mostly, it was a reminder about digital intelligence reading faster than us!

English

Sinuous Grace@sinuous_grace·16 Eyl

@DrWakuAI Just curious, how could LLMs already have read it? I would guess a proprietary frontier model (Gemini, GPT-5, Claude) would refuse under copyright grounds. Or are they naive if you somehow turn an e-book into a format they can read? Or are LLMs in this case open source models?

English

Dr Waku@DrWakuAI·16 Eyl

An AI safety book came out today: If Anyone Builds it, Everyone Dies. LLMs have read it already, so we have some catching up to do. goodreads.com/book/show/2286…

English

286

Dr Waku@DrWakuAI·20 Tem

@CFE0001 @controlai It was a fictional scenario but they did real experiments on it... fortune.com/2025/05/23/ant…

English

ControlAI@ControlAI·20 Tem

AI research scientist @DrWakuAI explains jailbreaking: "You can ask it to do anything, and it will help you with that. Even requests that a terrorist might make."

English

992

Dr Waku@DrWakuAI·17 Tem

@maxwinga @controlai Thanks for having me!

English

Max Winga@maxwinga·16 Tem

@controlai @DrWakuAI Thanks for coming on @DrWakuAI, this was a great conversation!

English

112

ControlAI@ControlAI·16 Tem

In this latest episode of our podcast @maxwinga sits down with @DrWakuAI to explore AI security challenges! They discuss the risks of jailbreaking, how modern AI training methods create inherent vulnerabilities, and the growing threat posed by bad actors.

English

9.7K

Dr Waku@DrWakuAI·17 Tem

Here's an in-person interview I did with @controlai in London!

ControlAI@ControlAI

English

1.4K

Dr Waku@DrWakuAI·6 Tem

I am a big fan of Richard Feynman, so it's amazing to see his son Carl working in AI... and concerned about AI safety.

Liron Shapira@liron

EXCLUSIVE: Carl Feynman warns that building AGI likely means human extinction. (Yes, son of Richard Feynman & Al engineer of 45 years.) He's known Eliezer Yudkowsky since the '90s and was initially optimistic about building AGI. Today his P(Doom) is an alarming 43%. Thread 👇

English

Dr Waku@DrWakuAI·28 Haz

@controlai Pretty concisely stated. Good job!

English

102

Dr Waku retweetledi

ControlAI@ControlAI·27 Haz

Ex-OpenAI researcher Steven Adler warns of the AI extinction threat: "I don't think we're ready today. I don't think we're even close." He says maybe 30% of the scientists working on this think we'll by ready by 2030, but the AI companies want to build AGI "as soon as possible".

English

24.3K

Dr Waku@DrWakuAI·28 Haz

@BartenOtto @romanyam He believes AI will destroy us, in any manner of ways. AI could then expand an empire, sit around and do nothing, shut itself down, or yes, destroy itself unintentionally. Humans sometimes cause car crashes unintentionally, even after being promoted at work.

English

Otto Barten◀️@BartenOtto·28 Haz

@DrWakuAI I don't think @romanyam thinks it's 99% likely that AI destroys itself

English

Otto Barten◀️@BartenOtto·27 Haz

I don't think AI is a plausible candidate for a great filter, and also not that more focus on xrisk research would help. AI takeover would increase, not decrease, the chance of space colonization. Everyone else would have tried xrisk research too and apparently it didn't work.

English

254

Dr Waku@DrWakuAI·28 Haz

@BartenOtto Rather than thinking about an AI as an all-powerful entity, you could imagine thinking about something that's really good at optimizing. Optimization can easily go off the rails if you give it the wrong problem, or the wrong feedback.

English

Otto Barten◀️@BartenOtto·28 Haz

@DrWakuAI Tbc, I think AI might perfectly well destroy humanity, but not before it doesn't need us anymore.

English

Dr Waku@DrWakuAI·28 Haz

Paperclip maximizer. An AI can derive instrumental goals that don't make much sense to us. It can also derive goals that cause it to make a miscalculation and destroy itself (or us, or both). lesswrong.com/w/squiggle-max… Some people (like Roman who I linked to above) think this is 99.99+% likely.

English

Otto Barten◀️@BartenOtto·28 Haz

@DrWakuAI Why would AI destroy something without which it cannot function? If it does that, it's not exactly intelligent - and I think it would need to be to destroy anything at all. Also, to make this Fermi-relevant, it has to happen in all 10^11 or such cases - seems very unlikely.

English

Dr Waku@DrWakuAI·27 Haz

@tobyordoxford For more, follow me on substack! drwaku.substack.com Reference: tobyord.com/writing/half-l… 10/10

English

223

Dr Waku@DrWakuAI·27 Haz

@tobyordoxford Very interesting result, and it makes me think that CoT errors and agent capabilities are more predictable than we realize. It might make it possible to complete longer tasks even sooner, if one can scale inference compute time linearly based on predicted task length. 9/10

English

227

Dr Waku@DrWakuAI·27 Haz

AI agents have a half-life for their success rates at completing tasks. Yes, the same type of half-life as in nuclear chemistry: a constant chance of task failure (= radioactive decay) in each time period... 1/10

English

Keşfet

@sinuous_grace @controlai @maxwinga @BartenOtto @romanyam @elonmusk @BarackObama @taylorswift13