Post

@binarybits "ambition" maybe human, survival is a universal subgoal for any agents having any goals whatsoever. en.wikipedia.org/wiki/Instrumen…
These goals can arise through various means, including RL, explicit programming or others
English

@binarybits Of course it's not 'hard to train models that have no goals beyond the narrow ones we give them' - we do that right now. However, if we want real intelligence then we want the computation to come up with subgoals. And if the computation knows enough about itself then...
English

@binarybits Geoffrey Hinton says you're dead wrong. He says any goal creates subgoals, and when you cannot predict, understand or control those subgoals with an ASI you are most likely dead. You think it won't be hard to keep goals narrow. What do you know that Hinton doesn't?
English

@binarybits Why would it not be hard to train an intelligent system which can pursue the narrow goal we give it but not the instrumental goal of staying alive?
English

@binarybits This argument has always been related to instrumental incentives or the complexity of goals rather than anthropomorphising AI.
English

@binarybits I'd think you'd at least need curiosity/play to build an aGi and that can spin off in to many things. And your fitness function has to be so general it can lead to many hidden motivations.
English

@binarybits Are you saying "intelligent" systems can't come up with goals on its own? Or ways to bypass the limitations we build into it?
English

@binarybits Your comments seem to suggest you don’t believe singularity is possible. Or at least your argument seems to ignore that large issue.
English

@binarybits "X-riskers assume that human-level intelligence will inevitably lead to human-level ambition and a human-like survival instinct"
Are you sure that is what they think or is it just an assumption on your part?
English

@binarybits Sure, until we actively build them to pursue goals. Also: gwern.net/tool-ai
English

@binarybits The narrow goals we give them is one of Yudkowsky's pillars for extinction risk.
English

@binarybits The survival instinct isn't human-like but life-like. It's an effect of procreation via genetic recombination.
Until AI models start having babies where only the fittest survive long enough to reproduce, it's unlikely AI will spontaneously develop adversarial traits.
English

@binarybits AFAIK, few believe that. I think this falls into the category of a rumor which was possibly started by doom critics LeCun and Pinker.
English
