[agi] Optimality of using probability

Eliezer S. Yudkowsky Fri, 02 Feb 2007 20:23:55 -0800

Ben Goertzel wrote:
>
> Cox's axioms and de Finetti's subjective probability approach,
> developed in the first part of the last century, give mathematical
> arguments as to why probability theory is the optimal way to reason
> under conditions of uncertainty.  However, given limited computational
> resources, AI systems cannot always afford to reason optimally.  It is
> thus interesting to ask how Cox's or deFinetti's ideas can be extended
> to the situation of limited computational resources.  Can one show
> that, among all systems with a certain amount of resources, the most
> intelligent one will be the one whose reasoning most closely
> approximates probability theory?

I don't think a mind that evaluates probabilities *is* automatically thebest way to make use of limited computing resources. That is: if youhave limited computing resources, and you want to write a computerprogram that makes the best use of those resources to solve a problemyou're facing, then only under very *rare* circumstances does it makesense to write a program consisting of an intelligent mind that thinksin probabilities. In fact, these rare circumstances are what define AGIwork.

If you know in advance that the task is to solve a Sudoku puzzle, thenyou'll be better off writing a specialized Sudoku solver. If you knowthe exact Sudoku puzzle you face, and its solution, you can write aneven more specialized program: one that just spits out that solution. Arational use of computing power, like any rational plan, is "rational"relative to the subjective uncertainty of the programmer about theenvironment. If you already knew the exact solution, you could writedown that solution instead of writing a computer program to compute it.

If I know my program will face a problem with known statisticalstructure, then I will write a program that processes probabilitiesusing predefined calculations. That's one circumstance under which youwould want to use a program that processes probabilities - when you seea specific probabilistic calculation that optimizes the problem,relative to your beliefs about the environment.

But what if you don't even know whether your program will encounter aSudoku program, or something else entirely? What if you don't know allthe environmental entities your program might interact with, or whatmight be a good way to model them? Then you must somehow write a moregeneral program. Should this program process probabilities, even thoughwe don't know all the kinds of events it might discover and attachprobabilities to?

What state of subjective uncertainty must you, the programmer, be inwith respect to the environment, before coding a probability-processingmind is a rational use of your limited computing resources? This is howI would state the question.

Intuitively, I answer: When you, the programmer, can identify parts ofthe environmental structure; but you are extremely uncertain about otherparts of the environment; and yet you do believe there's structure (theunknown parts are not believed by you to be pure random noise). In thiscase, it makes sense to write a Probabilistic Structure Identifier andExploiter, aka, a rational mind.

Note that I specify you must understand *part of* the structure of theenvironment. You, as the programmer, have some kind of goal you aretrying to achieve by rationally using your computing power; it isdifficult to have a utility function over random noise. Your programmust *use* the unknown parts of the environmental structure to achievethat which you started out to accomplish. You have to tie in thediscovered structures to the utility differences you care about. Thisrequires that you understand explicitly how your own utility functionrelates to the environment, so you can reproduce that relation in aprogram; and this requires that you start out with some knowledge of theenvironment already.

I.e: If you don't know at least some identifying characteristics ofstarving African children, your state of knowledge does not let youwrite a program that has feeding starving African children as a "goal".In fact, there's no sense in which you yourself can be said to knowthat starving African children exist; and no way you could identify themas important if you saw them; and no way you could realize that*feeding* them might increase expected utility, once you discovered thepreviously unsuspected existence of food.

So that's the intuitive statement. I can't state this precisely as yet.It's relatively simple to make a subjective probabilistic state ofuncertainty reproduce itself in an exact corresponding calculation - toshow that if you think a particular specific event is 90% probable, thenyou want your computer program to represent it as 90% probable, giventhat it uses probabilities at all. As for justifying a genericprobability-processing system - I won't say that it's a lot harder,because I don't actually *know* that it's a lot harder, because I don'tknow exactly how to do it, and therefore I don't know yet how hard oreasy it will be. I suspect it's more complicated than the simple case,at least.

I tried to solve this problem in 2006, just in case it was easier thanit looked (it wasn't). I concluded that the problem required a fairlysophisticated mind-system to carry out the reasoning that would justifyprobabilities, so I was blocking on subparts of this mind-system that Ididn't know how to specify yet. Thus I put the problem on hold anddecided to come back to it later.

As a research program, the difficulty would be getting a researcher tosee that a nontrivial problem exists, and come up with somenon-totally-ad-hoc interesting solution, without their taking on aproblem so large that they can't solve it.

One decent-sized research problem would be scenarios in which you theprogrammer could expect utility from a program that used probabilities,in a state of programmer knowledge that *didn't* let you calculate thoseprobabilities yourself. One conceptually simple problem, that wouldstill be well worth a publication if no one has done it yet, would becalculating the expected utilities of using well-known uninformativepriors in plausible problems. But the real goal would be to justifyusing probability in cases of structural uncertainty. A simple case ofthis more difficult problem would be calculating the expected utility ofinducting a Bayesian network with unknown latent structure, known nodebehaviors (like noisy-or), known priors for network structures, anduninformative priors for the parameters. One might in this way work upto Boolean formulas, and maybe even some classes of arbitrary machines,that might be in the environment. I don't think you can do a similarcalculation for Solomonoff induction, even in principle, becauseSolomonoff is uncomputable and therefore ill-defined. For, say, Levinsearch, it might be doable; but I would be VERY impressed if anyonecould actually pull off a calculation of expected utility.

In general, I would suggest starting with the expected utility of simpleuninformative priors, and working up to more structural forms ofuncertainty. Thus, strictly justifying more and more abstract uses ofprobabilistic reasoning, as your knowledge about the environment becomesever more vague.


--
Eliezer S. Yudkowsky                          http://singinst.org/
Research Fellow, Singularity Institute for Artificial Intelligence

-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?list_id=303

[agi] Optimality of using probability

Reply via email to