Re: [computer-go] Re: Optimizing UCT

Olivier Teytaud Fri, 26 Oct 2007 09:19:42 -0700

"Technical Report on MoGo" link from http://www.lri.fr/~gelly/MoGo.htm<http://www.lri.fr/%7Egelly/MoGo.htm> eventually leads tohttp://hal.inria.fr/docs/00/12/15/16/PDF/RR-6062.pdf<http://hal.inria.fr/docs/00/12/15/16/PDF/RR-6062.pdf>
First-play urgency is described in the bottom paragraph on page 16.Results are shown 4 pages later in table 7 (influence of FPU).


MoGo is very different now.

There's no UCT anymore in MoGo, as UC in UCT means "upper-confidence" asin UCB1 (but there is still Monte-Carlo planning). The upper confidenceterm is the sqrt(log(...)/n_i) term; there's nothing like that in MoGo now.

But if one keeps UCB, I agree that UCB-Tuned is better than UCB1, andfor problems

far from computer-go UCT is very efficient with its upper-confidence-term.
   Olivier
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Re: Optimizing UCT

Reply via email to