Hi all,

After a long search on the computer go mailing list archive and reading and
reading again the paper of Gelly and Silver (ICML 2007) I didn't find
answers to my question.
In this paper they introduce a way to select the next move, at a given
state, using the rave and uct value of its childs. They do this by

(1-beta)*Q_uct + beta*Q_rave

But, by the definition of the rave and uct value, for each child of a given
node we may have the following situation :

- its rave and uct value are defined ( in this case we can compute the above

- only the rave value is defined (in this situation the n(s,a) = 0 and the
uct value is not defined)

- neiher rave nor uct value is defined

So my question is how they handle these case when they traverse the tree ?
Because their score are not always defined for every childs of a node.


computer-go mailing list

Reply via email to