> After the program plays all simulations, which move should it choose? > > (Wins/Visits) + SQRT(ln(...)) > > or > > (Wins+Draw/2)/Visits + SQRT(ln(...)) > > > None of these two formula :-) > These formulas is for choosing moves to be simulated. For turn-based games, > when al simulations are finished, we should choose
> move = argmax_m number_of_simulations(m) Olivier, I'm still confused. How should look like the formula for the final decision? This is very important, becuase if we don't choose a right move to play, all previous work is useless. Thank you. _______________________________________________ Computer-go mailing list [email protected] http://dvandva.org/cgi-bin/mailman/listinfo/computer-go
