Re: [computer-go] More UCT / Monte-Carlo questions

Jason House Tue, 05 Feb 2008 10:26:13 -0800

On Feb 5, 2008 1:15 PM, Olivier Teytaud <[EMAIL PROTECTED]> wrote:

> > I think mogo is the only strong program that uses the UCB1-tuned
> formula.
> > The others use the same formula you use.  I found a thesis where they
> > measured many different formulas and found little difference.   If any
> > strong program other than mogo uses some formula other than the basic
> one,
> > can you please let us know?
> >
>
> MoGo does not use the UCB1 formula or the UCB1-Tuned formula
> (see ICML paper from Gelly and Silver). Also, now, the formula
> has been tuned again and again and does not look anymore like
> something I could explain :-) this is almost genetic programming :-)
>
> Basically, the formula in MoGo combines the success ratio and the
> RAVE-success ratio, with more focus on the success ratio when the
> number of simulations is large.



Please correct me if I'm wrong, but I thought:
  Only once you combine the two components does the influence of UCB1 and
UCB1-tuned become less obvious.  If you look at just the RAVE success ratio
component, or just the success ratio component, I believe the UCB1-Tuned
formula is still present.

_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] More UCT / Monte-Carlo questions

Reply via email to