Re: [agi] What should we do to be prepared?

j.k. Wed, 05 Mar 2008 14:16:32 -0800

On 03/05/2008 12:36 PM,, Mark Waser wrote:

snip...
The obvious initial starting point is to explicitly recognize that thepoint of Friendliness is that we wish to prevent the extinction of the*human race* and/or to prevent many other horrible nasty things thatwould make *us* unhappy. After all, this is why we believeFriendliness is so important. Unfortunately, the problem with thisstarting point is that it biases the search for Friendliness in adirection towards a specific type of Unfriendliness. In particular,in a later e-mail, I will show that several prominent features ofEliezer Yudkowski's vision of Friendliness are actually distinctlyUnfriendly and will directly lead to a system/situation that is lesssafe for humans.
One of the critically important advantages of my proposeddefinition/vision of Friendliness is that it is an attractor in statespace. If a system finds itself outside (but necessarilysomewhat/reasonably close) to an optimally Friendly state -- it willactually DESIRE to reach or return to that state (and yes, I *know*that I'm going to have to prove that contention). While Eli's visionof Friendliness is certainly stable (i.e. the system won'tintentionally become unfriendly), there is no "force" or desirehelping it to return to Friendliness if it deviates somehow due to anerror or outside influence. I believe that this is a *serious*shortcoming in his vision of the extrapolation of the collectivevolition (and yes, this does mean that I believe both thatFriendliness is CEV and that I, personally, (and shortly, wecollectively) can define a stable path to an attractor CEV that isprovably sufficient and arguably optimal and which should hold upunder all future evolution.
TAKE-AWAY:  Friendliness is (and needs to be) an attractor CEV
PART 2 will describe how to create an attractor CEV and make it moreobvious why you want such a thing.
!! Let the flames begin !!            :-)

1. How will the AI determine what is in the set of "horrible nastything[s] that would make *us* unhappy"? I guess this is related to howyou will define the attractor precisely.

2. Preventing the extinction of the human race is pretty clear today,but *human race* will become increasingly fuzzy and hard to define, aswill *extinction* when there are more options for existence thanexistence as meat. In the long term, how will the AI decide who is"*us*" in the above quote?


Thanks,

jk

-------------------------------------------
agi
Archives: http://www.listbox.com/member/archive/303/=now
RSS Feed: http://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
http://www.listbox.com/member/?member_id=8660244&id_secret=95818715-a78a9b
Powered by Listbox: http://www.listbox.com

Re: [agi] What should we do to be prepared?

Reply via email to