Dr Waku

209 posts

Dr Waku banner
Dr Waku

Dr Waku

@DrWakuAI

YouTuber, AI research scientist, computer science PhD. I talk about how AI will affect all of us and society as a whole.

Canada Katılım Temmuz 2016
213 Takip Edilen1.1K Takipçiler
Dr Waku retweetledi
Max Tegmark
Max Tegmark@tegmark·
If a country or company builds superintelligence, they'll and up *losing* power, not gaining power – this thought-provoking study explains why:
Anthony Aguirre@AnthonyNAguirre

Superintelligence, if we develop it using anything like current methods, would not be under meaningful human control. That's the bottom-line of a new study I've put out entitled Control Inversion (link in second post.) Many experts I talk to who take superintelligence (real, general-purpose, autonomous superintelligence) seriously agree with this. But I wanted to take a deep dive to assess whether I think it's true, and why. Unfortunately, I'm now more convinced than ever. The basic argument is laid out below, but the core implication is worth putting up here: the global race to superintelligence is fundamentally misguided. Companies and countries are rushing to be first, believing whoever builds superintelligence will "grab the prize" of unprecedented power and wealth. This is dangerously wrong. Superintelligent systems would not bestow power on their creators, they would absorb it. Even if superintelligence does not "go rogue" (which it might), humans – including superintelligence's creators – would find themselves sidelined as it makes decisions faster than them, with more complex plans, and with strategic foresight beyond human comprehension. Whether quickly or slowly, losing control of superintelligence would inexorably lead to losing control to superintelligence. If humanity wants to stay in the driver's seat of our civilization, we need to give up this race. So what does the paper say?

English
21
23
151
22.6K
Dr Waku
Dr Waku@DrWakuAI·
If you're American and want to support AI safety, you can donate to Alex Bores's campaign for Congress which started today. He is responsible for the RAISE bill in NY, which has been very helpful for AI safety.
English
2
2
8
691
Dr Waku
Dr Waku@DrWakuAI·
@sinuous_grace I saw someone feed the PDF of the book to a bunch of LLMs (including many commercial ones) and ask them to produce a one sentence summary. Which means it's in their training data now if they want it. Mostly, it was a reminder about digital intelligence reading faster than us!
English
0
0
2
17
Sinuous Grace
Sinuous Grace@sinuous_grace·
@DrWakuAI Just curious, how could LLMs already have read it? I would guess a proprietary frontier model (Gemini, GPT-5, Claude) would refuse under copyright grounds. Or are they naive if you somehow turn an e-book into a format they can read? Or are LLMs in this case open source models?
English
1
0
1
34
Dr Waku
Dr Waku@DrWakuAI·
An AI safety book came out today: If Anyone Builds it, Everyone Dies. LLMs have read it already, so we have some catching up to do. goodreads.com/book/show/2286…
English
1
1
13
286
ControlAI
ControlAI@ControlAI·
AI research scientist @DrWakuAI explains jailbreaking: "You can ask it to do anything, and it will help you with that. Even requests that a terrorist might make."
English
6
3
18
992
ControlAI
ControlAI@ControlAI·
In this latest episode of our podcast @maxwinga sits down with @DrWakuAI to explore AI security challenges! They discuss the risks of jailbreaking, how modern AI training methods create inherent vulnerabilities, and the growing threat posed by bad actors.
English
3
5
28
9.7K
Dr Waku retweetledi
ControlAI
ControlAI@ControlAI·
Ex-OpenAI researcher Steven Adler warns of the AI extinction threat: "I don't think we're ready today. I don't think we're even close." He says maybe 30% of the scientists working on this think we'll by ready by 2030, but the AI companies want to build AGI "as soon as possible".
English
6
19
52
24.3K
Dr Waku
Dr Waku@DrWakuAI·
@BartenOtto @romanyam He believes AI will destroy us, in any manner of ways. AI could then expand an empire, sit around and do nothing, shut itself down, or yes, destroy itself unintentionally. Humans sometimes cause car crashes unintentionally, even after being promoted at work.
English
1
0
1
41
Otto Barten◀️
Otto Barten◀️@BartenOtto·
I don't think AI is a plausible candidate for a great filter, and also not that more focus on xrisk research would help. AI takeover would increase, not decrease, the chance of space colonization. Everyone else would have tried xrisk research too and apparently it didn't work.
Otto Barten◀️ tweet media
English
4
0
3
254
Dr Waku
Dr Waku@DrWakuAI·
@BartenOtto Rather than thinking about an AI as an all-powerful entity, you could imagine thinking about something that's really good at optimizing. Optimization can easily go off the rails if you give it the wrong problem, or the wrong feedback.
English
0
0
1
19
Otto Barten◀️
Otto Barten◀️@BartenOtto·
@DrWakuAI Tbc, I think AI might perfectly well destroy humanity, but not before it doesn't need us anymore.
English
1
0
1
25
Dr Waku
Dr Waku@DrWakuAI·
Paperclip maximizer. An AI can derive instrumental goals that don't make much sense to us. It can also derive goals that cause it to make a miscalculation and destroy itself (or us, or both). lesswrong.com/w/squiggle-max… Some people (like Roman who I linked to above) think this is 99.99+% likely.
English
1
0
0
24
Otto Barten◀️
Otto Barten◀️@BartenOtto·
@DrWakuAI Why would AI destroy something without which it cannot function? If it does that, it's not exactly intelligent - and I think it would need to be to destroy anything at all. Also, to make this Fermi-relevant, it has to happen in all 10^11 or such cases - seems very unlikely.
English
2
0
0
30
Dr Waku
Dr Waku@DrWakuAI·
@tobyordoxford Very interesting result, and it makes me think that CoT errors and agent capabilities are more predictable than we realize. It might make it possible to complete longer tasks even sooner, if one can scale inference compute time linearly based on predicted task length. 9/10
English
1
0
4
227
Dr Waku
Dr Waku@DrWakuAI·
AI agents have a half-life for their success rates at completing tasks. Yes, the same type of half-life as in nuclear chemistry: a constant chance of task failure (= radioactive decay) in each time period... 1/10
English
2
0
11
2K