I guess that for AIXI to learn this sort of thing, it would have to be rewarded for understanding AIXI in general, for proving theorems about AIXI, etc. Once it had learned this, it might be able to apply this knowledge in the one-shot PD context.... But I am not sure.
ben > -----Original Message----- > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On > Behalf Of Eliezer S. Yudkowsky > Sent: Saturday, February 15, 2003 3:36 PM > To: [EMAIL PROTECTED] > Subject: Re: [agi] Breaking AIXI-tl > > > Ben Goertzel wrote: > >>AIXI-tl can learn the iterated PD, of course; just not the > >>oneshot complex PD. > > > > But if it's had the right prior experience, it may have an > operating program > > that is able to deal with the oneshot complex PD... ;-) > > Ben, I'm not sure AIXI is capable of this. AIXI may inexorably predict > the environment and then inexorably try to maximize reward given > environment. The reflective realization that *your own choice* to follow > that control procedure is correlated with a distant entity's > choice not to > cooperate with you may be beyond AIXI. If it was the iterated PD, AIXI > would learn how a defection fails to maximize reward over time. But can > AIXI understand, even in theory, regardless of what its internal programs > simulate, that its top-level control function fails to maximize the a > priori propensity of other minds with information about AIXI's internal > state to cooperate with it, on the *one* shot PD? AIXI can't take the > action it needs to learn the utility of... > > -- > Eliezer S. Yudkowsky http://singinst.org/ > Research Fellow, Singularity Institute for Artificial Intelligence > > ------- > To unsubscribe, change your address, or temporarily deactivate > your subscription, > please go to http://v2.listbox.com/member/?[EMAIL PROTECTED] > ------- To unsubscribe, change your address, or temporarily deactivate your subscription, please go to http://v2.listbox.com/member/?[EMAIL PROTECTED]
