My original example was unrealistic and on the extreme side to make a point.
However if there are nodes with say 7/10, 12/20, and 50/100 how should they be
ranked? In some sense, the first one seems promising since we've only searched
just a few nodes, yet we are mainly seeing wins (granted,
On Thu, Jul 03, 2014 at 06:18:32AM -0700, Greg Schmidt wrote:
Perhaps there is an argument that the UCB formula won't generally let this
happen since it takes into consideration both win rate and tries to increase
confidence by promoting the visit of nodes with low visit counts. Still, I