Please correct me if I'm wrong, but I thought:
 Only once you combine the two components does the influence of UCB1 and
UCB1-tuned become less obvious.  If you look at just the RAVE success ratio
component, or just the success ratio component, I believe the UCB1-Tuned
formula is still present.


In the ICML paper it is present; but it has been removed later. It was
just useless.
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to