I am very confused about the new UCT-RAVE formula.
The equation 9 seems to mean:

variance_u = value_ur * (1 - value_ur) / n.

Is it wrong?  If correct, why is it the variance?
I think that the variance of the UCT should be:

variance_u = value_u * (1 - value_u).

Why cannot we use that?

Anyway, can anyone write the pseudo-code of this algorithm?

--
Yamato
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to