Please correct me if I'm wrong, but I thought: Only once you combine the two components does the influence of UCB1 and UCB1-tuned become less obvious. If you look at just the RAVE success ratio component, or just the success ratio component, I believe the UCB1-Tuned formula is still present.
In the ICML paper it is present; but it has been removed later. It was just useless. _______________________________________________ computer-go mailing list [email protected] http://www.computer-go.org/mailman/listinfo/computer-go/
