Re: [computer-go] Monte-Carlo Go simulation

alain Baeckeroot Thu, 08 Feb 2007 12:03:41 -0800

Le jeudi 8 février 2007 17:06, Sylvain Gelly a écrit :
> Hello,
> 
> > Is there any known (by theory or tests) function of how much a increase
> > in the strength of the simulation policy increases the strength of the
> > MC/UCT Program as a whole?
> 
> I think that is a very interesting question.
> In our work on MoGo we found that there could be a decrease of the
> strength of the MC/UCT program while using a stronger simulation
> policy. It is why in MoGo it is more the "sequence idea", than the
> "strength idea". Our best simulation policy is quite weak compared to
> others we tested.
> But we have further experiments, in a work with David Silver from the
> university of Alberta. We found out that the relation "strong
> simulation policy" <=> "strong MC program" is wrong at a much larger
> scale. So the "intransivity" is true even with much much stronger
> simulation policies.
> 
One simple explaination could be that a random player shamelessly tries "all"
moves (very bad ones but also very nice tesuji) whereas the "stronger" player
is restricted by its knowledge and will always miss some kind of moves.


Things alike were reported by D Doshay and Sluggo, which is limited by the 
underlying gnugo, no matter how deep/wide it searches.

Alain
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Monte-Carlo Go simulation

Reply via email to