KL-UCB algorithm
http://arxiv.org/pdf/1102.2490v4.pdf

"Thus, KL-UCB is optimal for Bernoulli distributions and strictly dominates
α-UCB for any
bounded reward distributions."
http://www.princeton.edu/~sbubeck/SurveyBCB12.pdf (page 18)

-- 
Łukasz
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Reply via email to