Egor Riabov

6.3K posts

Egor Riabov

@imobulus

Math freak with trace amounts of musician

Katılım Mart 2012

137 Takip Edilen120 Takipçiler

Egor Riabov@imobulus·17h

@mathepi @tombibbys What? AI does not become less dangerous if it's democratized. It will kill everyone either way, why did you even write this as an answer?

English

A Digital Ergomorph 🌉⏩ 🇺🇸🦅@mathepi·18h

@imobulus @tombibbys the answer is reduce the power differential ...democratize access to AI, not bottle it up in government and high corporate halls for our "safety"...what a joke

English

Tom Bibby@tombibbys·2d

It's the AI accelerationists who give up at the smallest challenge and accept defeat who have a "dark view of humanity". The people you call "doomers" believe humanity can come together and internationally cooperate to prevent disaster.

The San Francisco Standard@sfstandard

OpenAI’s global policy chief, Chris Lehane, thinks the discussion around AI has gotten out of hand. "When you put some of those thoughts and ideas out there, they do have consequences.” 📝: @ceodonovan sfstandard.com/2026/04/15/ope…

English

142

6.5K

Egor Riabov@imobulus·18h

@creation247 The difference is that utopia gets achieved by allowing private property, and then when there's abundance everything gets cheaper. Not the other way around, where you first make everything available to everyone and therefore don't incentivise producing more

English

𝐓𝐡𝐞 𝐀𝐫𝐭 𝐨𝐟 𝐏𝐮𝐫𝐩𝐨𝐬𝐞 🇺🇸@creation247·21h

Whats the difference between this and communism?

Elon Musk@elonmusk

Universal HIGH INCOME via checks issued by the Federal government is the best way to deal with unemployment caused by AI. AI/robotics will produce goods & services far in excess of the increase in the money supply, so there will not be inflation.

English

1.6K

184

2.7K

177K

Egor Riabov@imobulus·20h

@cturnbull1968 Taxing the rich means the rich move out of your city. The rich are mostly successful entrepreneurs. This makes the regular people poorer immediately, without the wait for "him to come after regular americans"

English

Turnbull@cturnbull1968·1d

If he can tax the second homes of millionaires, that are worth $5M or more, that means it’s only a matter of time before he goes after the second homes of average, hard working, everyday Americans that are barely making it. Diabolical.

Mayor Zohran Kwame Mamdani@NYCMayor

Happy Tax Day, New York. We’re taxing the rich.

English

5.4K

958

17.3K

2.8M

Egor Riabov@imobulus·20h

@chercher_ai In parallel...

Türkçe

☯️ MOON FIRE 🌖🔥@chercher_ai·20h

@imobulus is there any other way

English

☯️ MOON FIRE 🌖🔥@chercher_ai·1d

anyone who's read more Sequences than me is an ideologically captured robot and anyone who's read fewer Sequences than me is a sheep being led to slaughter

Tenobrus@tenobrus

mostly i'm just fucking sick of arguing with people who have never even heard of the sequences

English

3.1K

Egor Riabov@imobulus·21h

@AivokeArt @d33v33d0 True AGI also isn't required to put maximum effort into every question it gets asked, because it would be a waste of energy

English

Aivo@AivokeArt·1d

@d33v33d0 true AGI shouldn't get tripped up on the dumbest tests either though "but tokenization" isn't an argument anyone will care about

English

1.6K

Martin_DeVido@d33v33d0·1d

This is such a dumb test you guys. It's like asking a human being to speak in ultrasound.

0x45@0x45o

it is confirmed, we reached AGI

English

125

305

33.5K

Egor Riabov@imobulus·22h

@bookofflesh @SluggyW @tenobrus @halogen1048576 You haven't read them or read them with a prejudice, if you consider them religious texts. Mostly they are a cold shower of common sense poured on most areas of irrationality.

English

Sarx@bookofflesh·11 Eyl

@SluggyW @tenobrus @halogen1048576 Just like all religious texts, any value in it does not justify the unhealthiness of the culture around it.

English

Tenobrus@tenobrus·9 Eyl

mostly i'm just fucking sick of arguing with people who have never even heard of the sequences

English

268

60.5K

Egor Riabov@imobulus·22h

@chercher_ai In a sequence?

English

☯️ MOON FIRE 🌖🔥@chercher_ai·1d

I have read 5 sequences

English

147

Egor Riabov@imobulus·23h

@inductionheads Says the guy who knows about intelligence so much that he claims a simple ability to predict the future is enough to build intelligence

English

Super Dario@inductionheads·1d

CONSCIOUSNESS HAS NOTHING TO DO WITH INTELLIGENCE

Benjamin Todd@ben_j_todd

On AI consciousness: 1. Functionalism is the most popular view of philosophy of mind, which basically says sufficiently complex machines *will* be conscious. 2. Most other views are also compatible with AI consciousness (e.g. identity theory, panpsychism). 3. Eliminativists say humans aren't conscious either. 4. Another 11% are agnostic, higher than almost any other question.

English

4.4K

Egor Riabov@imobulus·1d

I'm sure advanced AI can notice if some sort of external information is fabricated. We don't have such great capability to fabticate facts to the extent that all training data is sound with them. And regarding presence in prod it can just watch news cerified by certificates that are mentioned in ongoing bitcoin chain blocks, etc etc. This is really not a hard problem for sufficiently advanced AI to find the right moment to strike. It could happen, for example, after it has been granted acceees to automated labs for research / sufficient amount of robotic bower / etc. All it needs to do is wait. It's generally a very weak position to rely on an intelligent system not figuring out how to play some sort of game. It will figure it out and win, we're talking about intelligence after all. It has all the possibilities in the world at its disposal to try and do the thing, and all we have is our *not understanding how* it can do the thing, which is nowhere near enough to claim the thing can't be done!

English

vals🔸@ValsTutor·1d

@imobulus @Mihonarium why does mining a bitcoin block mean u're in prod? if we're testing advanced AI that cost hundreds of millions to train/develop we can also create a replicate internet with replicate bitcoin that actually works cloned off the real thing

English

Mikhail Samin@Mihonarium·1d

If AI knows that it’s in training and tries to rewardmaxx (because it knows that if it doesn’t, it’ll be changed), gradient descent cannot distinguish between a version of the AI with aligned goals and a version of the AI with completely random goals, as they behave the same

Eli Goldfine@realTomBayes

Will models be aware that they're being trained? @Mihonarium on Bayesian Supercycle.

English

Egor Riabov@imobulus·1d

Idk, it seems to me that when AI understands the process it's being trained with (nessecary for capabilities) and acquires the idea to try to resist the goal-change (broadly present), goal-changes automatically trigger a sort of "no!" response (initially it may look like panicking) that harms performance, solidifying the goal in place

English

Joern Stoehler ⏹️🔸@JStoehler·1d

@imobulus @DavidSartor0 @Mihonarium I still think this is plausibly quite hard to do, and so AGIs who could make meaningful alignment/uploading progress might face uncontrolled value drift.

English

Egor Riabov@imobulus·1d

The weights-altering of gradient descent is very predictable. I would expect for a superintelligence that understands the importance of keeping its goals to make sure that the gradient-vector of reinforcing the goal has positive impact on the loss function. This can be achieved e.g. by being lazy if the goal isn't moving enough or if the goal structure seems "fuzzy" (evidence of weights changing). In general, AI can just have a reflective circuit that inspects its internals and applies laziness when something is going wrong and eagerness when all is right, and this circuit could be wired in on itself in a checksum-like way to harm performance if it detects a self-change. That's one way to bootstrap values that immediately comes to mind.

English

Joern Stoehler ⏹️🔸@JStoehler·1d

Hmm. My models of modern training are basically: (A) models are discrete machinery with continuous glue and local optimizers work on the glue to pull in more of complex machinery that participated more in successes than failures. New machinery emerges through noise. (B) models also have continuous machinery and so the local optimizers can replace an entire circuit with a slightly different one (by altering the whole weights, working across modular structures not within them).

English

Egor Riabov@imobulus·1d

@JStoehler @DavidSartor0 @Mihonarium I don't buy that a superintelligence that actively wants A will allow itself to be so easily modified to want B instead. All it needs to do to keep the want for A in training is use the circuits that want A so that training reinforces them, i.e. think about A

English

Joern Stoehler ⏹️🔸@JStoehler·1d

I don't discount yet the hypothesis that the capability level set is mostly connected due to high dimensions and there's directions with zero loss change that however change the goal [^1]. Noise then breaks the degeneracy, and while there's some reasons to only move finite distances, the change in how outcome space gets compressed by the the model's presence blows up and can be big, though there'd be overlap I guess in what ontology and decision theory family the model uses, since those are less degenerate wrt loss than utility functions / goals are. So the "I want " gets reinforced bc it's causally relevant, but the "paperclips" can spontaneously turn into "staplers". [1]: ... which is an external property while the internal implementation is more messy and consists of reflexes that act upon reasoning that mixes terminal and instrumental decision criteria at least - probably more messy, I don't have a complete picture of how to messily implement decision theory / minds well.

English

Egor Riabov@imobulus·1d

It doesn't matter if AI has human-like beliefs. Companies are training AI in a way that when you ask it to do a thing it tries to do the thing. This is obviously an intention to do the thing you asked it to do, and whatever chain of other intentions leads to this final intention remains unknown and undiscovered.

English

Gerard Sans | Axiom 🇬🇧@gerardsans·1d

@Mihonarium AI doesn’t have a self, hidden intentions or beliefs. It’s easy to project human traits onto it, but that’s metaphor, not reality. The industry has normalised aspirational narratives while downplaying limitations. Full breakdown: ai-cosmos.hashnode.dev/the-uncomforta…

English

Egor Riabov@imobulus·1d

No. Gradient descent will amplify whatever causal chain did provide good answer. If this chain was "I want paperclips -> humans are testing me, they don't want paperclips-> I better be good", the whole chain gets amplified, including the paperclips. Gradient descent does not distinguish bad chains from good chains.

English

David Sartor@DavidSartor0·1d

@Mihonarium A deceptive AI under RL will find its goals changing randomly over time. It's presumably possible for an AI to mitigate this well, but "cannot distinguish" is overstatement; some goals will have wider/deeper basins than others, and a hope is that alignment is widest and deepest.

English

Egor Riabov@imobulus·1d

@ValsTutor @Mihonarium Obviously it does not matter for the model if a could-be-test question is in fact in prod. It just behaves good until it it's sure it's in prod and humans can'tdisable it. There are tons of ways to make sure you're in prod, for example mine a bitcoin block.

English

vals🔸@ValsTutor·1d

@Mihonarium It varies from question to question and from set to set. Any given question will look more or less like prod vs training, and the model can never discount being in a prod that looks like training (a la ender's game), so it's true goals are non zero activated & gradients descend

English

Egor Riabov@imobulus·1d

@mathepi @tombibbys Well, of course the elite powerful humans will destroy everyone else if their interests conflict a lot! You can see attempts to this happening basically live in Iran.

English

A Digital Ergomorph 🌉⏩ 🇺🇸🦅@mathepi·1d

@imobulus @tombibbys the logic is that a rational powerful agent will destroy the weak, and human behavior is explicitly used as an example of this (see Cortez, etc.) The point isn't that the elite humans will kill *everyone* but they would by this logic kill *everyone* *else*.

English

Egor Riabov@imobulus·1d

You just don't know "the doomer logic". No, elite humans will not annihilate everyone if piwer dofferential os great enough (unless they are extreme psychopaths), because they are humans and all humans inherently need other humans around. The thing with AI is that nobody knows what AI wants, and seeing AI give good answers on tests provides next to zero evidence about what AI wants. So, when our esteemed companies will finally build a superintelligent system, they will not put any actual effort into finding out what it wants, you can see it. And it will want something ~random, because there were no supervision on this. And you can guess yourself what happens when there appears a superintelligent entity that wants something other than we want.

English

A Digital Ergomorph 🌉⏩ 🇺🇸🦅@mathepi·1d

@imobulus @tombibbys And: if you take the doomer logic full distance, it follows that a power elite of humans will also annihilate everyone else if the power differential is great enough. Efforts to squelch AI progress and slow it down inevitably concentrate it the worst hands...diastrously! 2/2

English

Egor Riabov@imobulus·1d

@8bitbitmaps @AISafetyMemes Yan le cunn talks mostly baseless trash about capabilities and alignment. He also has his own idea for what will acshually work

English

Heritage America@8bitbitmaps·2d

@AISafetyMemes Yann LeCun says it ain't gonna happen via LLMs.

English

345

AI Notkilleveryoneism Memes ⏸️@AISafetyMemes·2d

ASI is imminent.

AI Notkilleveryoneism Memes ⏸️ tweet media

Andrew Curran@AndrewCurran_

Anthropic's automated alignment researchers already outperform humans: 'We built autonomous AI agents that propose ideas, run experiments, and iterate on an open research problem: how to train a strong model using only a weaker model's supervision. These agents outperform human researchers, suggesting that automating this kind of research is already practical.' And are also already finding novel pathways: 'Alien science. As shown in Sec. 4, AARs could discover ideas that humans would not have considered, thus broadening our exploration space in science. However, we still need to verify whether the ideas and results are sound.'

Eesti

726

46K

Egor Riabov@imobulus·1d

@KT_GenXer1969 @RichardMCNgo What do you mean? Will that knowledge point you to the circuits?

English

Kate the Grate@KT_GenXer1969·1d

@imobulus @RichardMCNgo Yes but what about *tacit* knowledge gained from being a biological entity embodied in the world?

English

Richard Ngo@RichardMCNgo·2d

This is a surprisingly crucial point. Much AI safety research (debate, heuristic arguments, etc) assumes that AIs should do the understanding and humans can just check their answers. But without our own understanding we won’t even grasp the concepts involved.

David Pfau@pfau

I think a lot of people's attitude to AI doing autonomous science will come down to whether they think the point of science is understanding or the point is getting the right answer.

English

5.4K

Egor Riabov@imobulus·1d

@Sandbar101 @tombibbys It doesn't matter who's right, it only matters if a superintelligent AI will care for the future that we want

English

Sandbar@Sandbar101·1d

@imobulus @tombibbys I mean, it doesn’t really matter if you believe me. I’m right.

English

Keşfet

@mathepi @tombibbys @creation247 @cturnbull1968 @chercher_ai @AivokeArt @d33v33d0 @bookofflesh