> I suspect that for very long time controls we would be better off
> turning UCT (with, say 10K playouts)  into an evaluation function and
> then using alpha-beta on top of it.
>
> Álvaro.
>
I did do a study once with pure alpha beta where I used play-outs as my
evaluation function.  Interestingly,  what seems to be the case is that
as you search deeper,  you need less play-outs.    You can measure this
by plotting different depths and play-out counts and try to maximize the
strength vs time ratio.  

It also turns out that it is difficult to get a search to go fast if you
do not have enough play-outs.    Sometimes it was faster to do twice as
many play-outs.    This is because of move ordering issues.  A small
number of play-outs really hurts the number of beta cutoffs you can
expect to get because of scoring uncertainties.

With heavy play-outs,  this would likely be less of a factor.

- Don



_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to