> After the program plays all simulations, which move should it choose?
>
> (Wins/Visits) + SQRT(ln(...))
>
> or
>
> (Wins+Draw/2)/Visits + SQRT(ln(...))
>
>
> None of these two formula :-)
> These formulas is for choosing moves to be simulated. For turn-based games,
> when al simulations are finished, we should choose

> move = argmax_m number_of_simulations(m)



Olivier, I'm still confused. How should look like the formula for the final 
decision? This is very important, becuase if we don't choose a right move to 
play, all previous work is useless.

Thank you.

_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Reply via email to