Re: [Computer-go] Exploration formulas for UCT

2011-01-02 Thread Petr Baudis
Hi! On Sun, Jan 02, 2011 at 03:53:32PM +0800, Aja wrote: I guess it should be not * 3000 but / 3000. Zen also uses this type of formula, but the constant value is rather small. I use 400 for the latest version of Zen. If you are right, then it makes sense. For /3000, bias is around

Re: [Computer-go] Exploration formulas for UCT

2011-01-01 Thread Aja
Hi petr, We use the Silver formula: rave_visits / (rave_visits + real_visits + rave_visits * real_visits * 3000) The figure of 3000 is surprisingly resilient. Even with radically different heuristics and playouts, it stays the empirical optimum. Interesting. According to Sylvain's