Motivational Systems of an AI [WAS Re: [agi] RSI - What is it and how fast?]

Richard Loosemore Sun, 19 Nov 2006 09:21:23 -0800

Hank Conn wrote:

     > Yes, you are exactly right. The question is which of my
    assumption are
     > unrealistic?
    Well, you could start with the idea that the AI has "... a strong goal
    that directs its behavior to aggressively take advantage of these
    means...".   It depends what you mean by "goal" (an item on the task
    stack or a motivational drive?  They are different things) and this
    begs
    a question about who the idiot was that designed it so that it pursue
    this kind of aggressive behavior rather than some other!
A goal is a problem you want to solve in some environment. The "idiot"who designed it may program its goal to be, say, making paperclips.Then, after some thought and RSI, the AI decides converting the entireplanet into a computronium in order to figure out how to maximize thenumber of paper clips in the Universe will satisfy this goal quiteoptimally. Anybody could program it with any goal in mind, and RSIhappens to be a very useful process for accomplishing many complex goals.
    There is *so* much packed into your statement that it is difficult to go
    into it in detail.

    Just to start with, you would need to cross compare the above statement
    with the account I gave recently of how a system should be built with a
    motivational system based on large numbers of diffuse constraints.  Your
    description is one particular, rather dangerous, design for an AI - it
    is not an inevitable design.
I'm not asserting any specific AI design. And I don't see howa motivational system based on "large numbers of diffuse constrains"inherently prohibits RSI, or really has any relevance to this. "Amotivation system based on large numbers of diffuse constraints" doesnot, by itself, solve the problem- if the particular constraints do notform a congruent mapping to the concerns of humanity, regardless oftheir number or level of diffuseness, then we are likely facing anUnfriendly outcome of the Singularity, at some point in the future.

The point I am heading towards, in all of this, is that we need tounpack some of these ideas in great detail in order to come to sensibleconclusions.

I think the best way would be in a full length paper, although I didtalk about some of that detail in my recent lengthy post on motivationalsystems.

Let me try to bring out just one point, so you can see where I am goingwhen I suggest it needs much more detail. In the above, you really areasserting one specific AI design, because you talk about the goal stackas if this could be so simple that the programmer would be able toinsert the "make paperclips" goal and the machine would go right aheadand do that. That type of AI design is very, very different from theMotivational System AI that I discussed before (the one with the diffuseset of constraints driving it).


Here is one of many differences between the two approaches.

The goal-stack AI might very well turn out simply not to be a workabledesign at all! I really do mean that: it won't become intelligentenough to be a threat. Specifically, we may find that the kind ofsystem that drives itself using only a goal stack never makes it up tofull human level intelligence because it simply cannot do the kind ofgeneral, broad-spectrum learning that a Motivational System AI would do.

Why? Many reasons, but one is that the system could never learnautonomously from a low level of knowledge *because* it is using goalsthat are articulated using the system's own knowledge base. Put simply,when the system is in its child phase it cannot have the goal "acquirenew knowledge" because it cannot understand the meaning of the words"acquire" or "new" or "knowledge"! It isn't due to learn those wordsuntil it becomes more mature (develops more mature concepts), so how canit put "acquire new knowledge" on its goal stack and then unpack thatgoal into subgoals, etc?

Try the same question with any goal that the system might have when itis in its infancy, and you'll see what I mean. The whole concept of asystem driven only by a goal stack with statements that resolve on itsknowledge base is that it needs to be already very intelligent before itcan use them.

I have never seen this idea discussed by anyone except me, but it isextremely powerful and potentially a complete showstopper for the kindof design inherent in the goal stack approach. I have certainly neverseen anything like a reasonable rebuttal of it: even if it turns outnot to be as serious as I claim it is, it still needs to be addressed ina serious way before anyone can make assertions about what goal stacksystems can do.

What is the significance of just this one idea? That all the goal stackapproaches might be facing a serious a problem if they want to getautonomous, powerful learning mechanisms that build themselves from alow level. So what are AI researchers doing about this problem? Tryingto address the issue and figure out how to build goal/motivationalsystems that can bootstrap themselves? Not likely! They just sweep itunder the carpet and avoid getting their systems to be autonomous atall!! Amazing really: if there is a problem, just have a tacitagreement to postpone it until later, and pretend it will go away.

The only conceivable way to get around it is to hand-build an AI all theway up to the point where it can be generally intelligent enough tounderstand goals written in terms of its (hand-built) knowledge base.But this is just the Cyc approach, and Cyc would have to be so smartthat it could discuss with you the concepts on its goal stack before itwould be smart enough to carry on by itself. Anyone want to put bets onhow soon *that* is going to happen? Does hand-building an AI all theway up to that level sound like something a lone hacker is going to doin their basement when nobody is looking? I don't think so.

"But," you might say, "I could imagine a reasonably large, hand-builtgoal-stack AI still being smart enough to understand the <BuildPaperclips!> goal and then go around destroying the world. It might notbe generally intelligent, but it surely could be intelligent enough todo that?"

To this reply I would say: what makes you so sure that it would havethe power to be dangerous? That might just as easily be an illusionbrought on by watching too many Mad Robot movies .... if it is not up tofull general intelligence (i.e. if it does not have the ability tobootstrap itself using autonomous knowledge acquisition techniques),then the thing is going to be a pushover. It will have spent its lifedoing none of the exploratory, intelligence-building activity needed toget on in the real world. Instead, it is vulnerable to all the mistakesand stupidities put into it by a weary, error-prone gang of Aiprogrammers who tried their best to anticipate all the intelligence itwould have acquired if it had gained its knowledge autonomously.

I would claim, therefore, that when you assume that a goal stack AIsystem would actually make it to general intelligence (never mindsperintelligence!), you are quietly, unwittingly, making an enormousassumption about what kind of AI design would actually functioncorrectly. In that sense, any statement about how easy it would be fora paperclip maximizer to go on the rampage are just not believable.

Conclusion: the dangerousness of an AI depends crucially on verydetailed arguments about the viability of the design. Without thosedetailed arguments, no sensible conclusion are reachable.

I can't resist the temptation to close on humorous note, with an(edited) excerpt from Marvin's encounter with the Frogstar Scout robotclass D. THIS, I think, is the most likely character of an conversationbetween a real intelligence and a paperclip maximizing AI. :-)


***************************************************

[From Hitchhiker's Guide to the Galaxy, book 2 (Restaurant at the End ofthe Universe) chapter 7.

Marvin looked pitifully small as the gigantic black tank rolled to ahalt in front of him.


"Out of my way little robot," growled the tank.

"I'm afraid," said Marvin, "that I've been left here to stop you."

"You? Stop me?" roared the tank. "Go on!"

"No, really I have," said Marvin simply.

"What are you armed with?" roared the tank in disbelief.

"Guess," said Marvin.

"Errmmm ..." said the machine, vibrating with unaccustomed thought,"laser beams?"


Marvin shook his head solemnly.

"No," muttered the machine in its deep guttural rumble, "Too obvious.Anti-matter ray?" it hazarded.


"Far too obvious," admonished Marvin.

"Yes," grumbled the machine, somewhat abashed, "Er ... how about anelectron ram?"


This was new to Marvin. "What's that?" he said.

"One of these," said the machine with enthusiasm. From its turretemerged a sharp prong which spat a single lethal blaze of light. BehindMarvin a wall roared and collapsed as a heap of dust. The dust billowedbriefly, then settled.


"No," said Marvin, "not one of those."

"Good though, isn't it?"

"Very good," agreed Marvin.

"I know," said the Frogstar battle machine, after another moment'sconsideration, "you must have one of those new Xanthic Re-StructronDestabilized Zenon Emitters!"


"Nice, aren't they?" said Marvin.

"That's what you've got?" said the machine in considerable awe.

"No," said Marvin.

"Oh," said the machine, disappointed, "then it must be ..."

"You're thinking along the wrong lines," said Marvin, "You're failing totake into account something fairly basic in the relationship between menand robots."

"Er, I know," said the battle machine, "is it ..." it tailed off intothought again.

"Just think," urged Marvin, "they left me, an ordinary, menial robot, tostop you, a gigantic heavy-duty battle machine, whilst they ran off tosave themselves. What do you think they would leave me with?"

"Oooh, er," muttered the machine in alarm, "something pretty damndevastating I should expect."

"Expect!" said Marvin, "oh yes, expect. I'll tell you what they gave meto protect myself with shall I?"


"Yes, alright," said the battle machine, bracing itself.

"Nothing," said Marvin.

There was a dangerous pause. "Nothing?" roared the battle machine.

"Nothing at all," intoned Marvin dismally, "not an electronic sausage."

The machine heaved about with fury. "Well, doesn't that just take thebiscuit!" it roared, "Nothing, eh? Just don't think, do they?"

"And me," said Marvin in a soft low voice, "with this terrible pain inall the diodes down my left side."


"Makes you spit, doesn't it?"

"Yes," agreed Marvin with feeling.

"Hell that makes me angry," bellowed the machine, "think I'll smash thatwall down!" The electron ram stabbed out another searing blaze of lightand took out the wall next to the machine.


"How do you think I feel?" said Marvin bitterly.

"Just ran off and left you, did they?" the machine thundered.

"Yes," said Marvin.

"I think I'll shoot down their bloody ceiling as well!" raged the tank.It took out the ceiling of the bridge.


"That's very impressive," murmured Marvin.

"You ain't seeing nothing yet," promised the machine, "I can take outthis floor too, no trouble!" It took out the floor, too. "Hell's bells!"the machine roared as it plummeted fifteen storeys and smashed itself tobits on the ground below.


"What a depressingly stupid machine," said Marvin and trudged away.

******************************************************



Richard Loosemore








-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?list_id=303

Motivational Systems of an AI [WAS Re: [agi] RSI - What is it and how fast?]

Reply via email to