KL-UCB algorithm http://arxiv.org/pdf/1102.2490v4.pdf
"Thus, KL-UCB is optimal for Bernoulli distributions and strictly dominates α-UCB for any bounded reward distributions." http://www.princeton.edu/~sbubeck/SurveyBCB12.pdf (page 18) -- Łukasz
_______________________________________________ Computer-go mailing list [email protected] http://dvandva.org/cgi-bin/mailman/listinfo/computer-go
