Re: [agi] What should we do to be prepared?

j.k. Fri, 07 Mar 2008 16:23:16 -0800

On 03/07/2008 03:20 PM,, Mark Waser wrote:

> For there to be another attractor F', it would of necessity have to be
> an attractor that is not desirable to us, since you said there is only
> one stable attractor for us that has the desired characteristics.
Uh, no. I am not claiming that there is */ONLY/* one unique attractor(that has the desired characteristics). I am merely saying that thereis */AT LEAST/* one describable, reachable, stable attractor that hasthe characteristics that we want. (Note: I've clarified a previousstatement my adding the */ONLY/* and */AT LEAST /*and theparenthetical expression "that has the desired characteristics".)


Okay, got it now. At least one, not exactly one.

I really don't like the particular quantifier "rather minimal". Iwould argue (and will later attempt to prove) that the constraints arestill actually as close to Friendly as rationally possible becausethat is the most rational way to move non-Friendlies to a Friendlystatus (which is a major Friendliness goal that I'll be getting toshortly). The Friendly will indeed "have no qualms about kicking assand inflicting pain */where necessary/*" but the where necessaryclause is critically important since a Friendly shouldn't resort tothis (even for Unfriendlies) until it is truly necessary.


Fair enough. "rather minimal" is much too strong a phrase.

> I think you're fudging a bit here. If we are only likely to occupy the
> circumstance space with probability less than 1, then the intentional
> destruction of the human race is not 'most certainly ruled out': it is
> with very high probability less than 1 ruled out. I'm not trying to say
> it's likely; only that's it's possible. */I make this point todistinguish
> your approach from other approaches that purport to make absolute
> guarantees about certain things (as in some ethical systems where
> certain things are *always* wrong, regardless of context orcircumstance)./*Um. I think that we're in violent agreement. I'm not quite surewhere you think I'm fudging.

The reason I thought you were fudging was that I thought you were sayingthat it is absolutely certain that the AI will never turn the planetinto computronium and upload us *AND* there are no absolute guarantees.I guess I was misled when I read "given the circumstance space that weare likely to occupy with a huge certainty, the intentional destructionof the human race is most certainly ruled out" as meaning 'turning earthinto computronium is certainly ruled out'. It's only certainly ruled out*assuming* the highly likely area of circumstance space that we arelikely to inhabit. So yeah, I guess we do agree.

This raises another point for me though. In another post (2008-03-0614:36) you said:

"It would *NOT* be Friendly if I have a goal that I not be turned intocomputronium even if <your clause> (which I hereby state that I do)"

Yet, if I understand our recent exchange correctly, it is possible forthis to occur and be a Friendly action regardless of what sub-goals Imay or may have. (It's just extremely unlikely given ..., which is animportant distinction.) It would be nice to have some ballparkprobability estimates though to know what we mean by extremely unlikely.10E-6 is a very different beast than 10E-1000.

> I don't think it's inflammatory or a case of garbage in to contemplate> that all of humanity could be wrong. For much of our history, therehave
> been things that *every single human was wrong about*. This is merely
> the assertion that we can't make guarantees about what vastly superior
> f-beings will find to be the case. We may one day outgrow ourattachment
> to meatspace, and we may be wrong in our belief that everything
> essential can be preserved in meatspace, but we might not be at that
> point yet when the AI has to make the decision.
Why would the AI *have* to make the decision? It shouldn't be forit's own convenience. The only circumstance that I could think ofwhere the AI should make such a decision *for us* over ourobjections is if we would be destroyed otherwise (but there was no wayfor it to convince us of this fact before the destruction was inevitable).

It might not *have* to. I'm only saying it's possible. And it wouldalmost certainly be for some circumstance that has not occurred to us,so I can't give you a specific scenario. Not being able to find such ascenario is different though from there not actually being one. In orderto believe the later, a proof is required.

> Yes, when you talk about Friendliness as that distant attractor, it
> starts to sound an awful lot like "enlightenment", where self-interest
> is one aspect of that enlightenment, and friendly behavior is another
> aspect.
Argh! I would argue that Friendliness is *not* that distant. Can'tyou see how the attractor that I'm describing is both self-interestand Friendly because **ultimately they are the same thing** (OK, somaybe that *IS* enlightenment :-)

Well, I was thinking of the region of state space close to the attractoras being a sort of "approaching perfection" region in terms of certaindesirable qualities and capabilities, and I don't think we're reallyclose to that. Having said that, I'm by temperament a pessimist and askeptic, but I would go along with "heading in the right direction".


-joseph

-------------------------------------------
agi
Archives: http://www.listbox.com/member/archive/303/=now
RSS Feed: http://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
http://www.listbox.com/member/?member_id=8660244&id_secret=95818715-a78a9b
Powered by Listbox: http://www.listbox.com

Re: [agi] What should we do to be prepared?

Reply via email to