Re: [agi] Can humans keep superintelligences under control

Richard Loosemore Sun, 04 Nov 2007 09:15:39 -0800

Edward W. Porter wrote:

Richard in your November 02, 2007 11:15 AM post you stated:
“If AI systems are built with motivation systems that are stable, thenwe could predict that they will remain synchronized with the goals ofthe human race until the end of history.”
and
“I can think of many, many types of non-goal-stack motivational systemsfor which [Matt’s statement about the inherent instability of goalsystems of recursively self improving AGIs] is a complete falsehood.”
In your 11/3/2007 1:17 PM post you described what I assume to be such asuppostedly stable “non-goal-stack motivational system.” as follows:
“ consider the motivational system of the
best kind of AGI:  it is motivated by a balanced set of desires that
include the desire to explore and learn, and empathy for the human
species.  By definition, I would think, this simple cluster of desires
and empathic motivations *are* the things that "give it pleasure".

and

“I think that in general, making the AGI as similar to us as possible
(but without the aggressive and dangerous motivations that we are
victims of) would be a good idea simply because we want them to start
out with a strong empathy for us, and we want them to stay that way.”
I think this type of motivational system makes a lot of sense, but forall the reasons stated in my Fri 11/2/2007 2:07 PM post (arguments youhave not responded to) as well as many other reasons, it does appear atall certain such a motivational system would reliably remain stable and“synchronized with the goals of the human race until the end ofhistory,” as you claim.
For example, humans might for short sighted personal gain (such as whenusing them in weapon systems)

Whoaa! You assume that it would be possible to "use" an AGI forpersonal gain, or in a weapon system. If it starts out with thesupposed empathic motivational system, it would not allow this.



 or accidentally alter such a motivational

system.

Again, under the assumption, it would not allow such 'accidental"alteration.


 Or over time the inherent biases that were designed to make

AGI’s have empathy for humans, might cause it to have empathy for somehumans more than others

As part of its initial (assumed) empathy, it will set up mechanisms tomonitor such things. It could not possibly start having "more empathyfor some humans more than others" without *also* being aware of the factthat, by being so biassed, it was in conflict with its generalmotivation. So it would not do such a thing. (We have to remember notto assume it would be both superintelligent, and also suscetible to sucheasily-caught problems).


, or might cause them to make decisions that they

think are in our best interest, but would not.

Again, this assumes (implicitly) that it would both be generally andbroadly empathic -- which means sensitive to our wishes -- and at thesame time, for some inexplicable reason, decide to do something that itthinks is in our best interests, without consulting us. Effectivelyassumed that it would spontaneously *stop* being empathic, withoutexplaining how it could happen.




  Or perhaps AGI robots

would begin to embody the "human features" that they have been taught tobe empathetic to better than people. Etc.

Does this mean things like beginning to get aggressive, or jealous, etc?this is where the technical characteristics of a motivational systembecome important: this kind of drift would be impossible unless themotivational system already had aggressiveness modules built in (whichis not the case, by assumption).

The world is too complicated and is going to change too rapidly in thenext one hundred, one thousand, or ten thousand years for any goalsystem designed circa 2015 to remain appropriate until the end ofhistory – unless history ends pretty soon.


Not true:  the statement was that it would stay empathic to our motivations.

Only if the goal system were particularly rigid would this be a problem,and by assumption I am talking about motivational systems that arestable (diffuse systems, along the lines of my previously mentioned post).

If I am wrong I would appreciate the enlightenment and increased hopethat would come with being shown how I am wrong.

I apologize for giving too brief answers to these questions. I have toomuch stuff that is not written out in long form.



Richard Loosemore

Ed Porter


-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?member_id=8660244&id_secret=60958578-e87007

Re: [agi] Can humans keep superintelligences under control

Reply via email to