Re: [computer-go] New scalability study : show uncertainty ?

Jacques Basaldúa Wed, 23 Jan 2008 05:22:08 -0800

I don't think "only uniformly random playouts will scale toperfection" because what we need for playouts is not just a simpleaverage of final scores but a maximum (in negmax sense) score. Itshould be the perfect evaluation function.

In other words, as MC simulation is a way to get an average of avalue, when applying it to optimization problems we need some way tofocus the simulations to the _peak_ in a state space.

It may be obvious when one consideres L&D problems where the best movethat leads to the maximum score (live) is only one and all other movesare bad. At such positions it's almost no sense to simulate all legalmoves with same probability. So, IMHO, biasing simulations is notjust a speed-up technique but is essentially important.


I agree, but what I meant about uniformly random playouts is the following:
What makes a move outstanding is being unpredictable. For a total novice,
playing at the key point of a bulky five may look like a touch of genius,
but when you learn a little, its an obvious move. The difference between a

5p and a 9p may be one or two moves nobody can predict (except a 9p). Whenwe add knowledge we find the _ordinary_ good moves faster, we make weakermoves less probable, but that comes at a price, the price of making outstandingunpredictable moves less probable also. Perhaps that introduces a ceiling.I thought that was what you were also pointing. Of course, I don't claimuniformly random playouts are good, I just claim that they should (just as aninfeasible theoretic argument) scale to perfection, of course that scalingdoesn't have to be linear.


Jacques.


_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] New scalability study : show uncertainty ?

Reply via email to