>Q_MoGo(s,a) is:
>- if (self atari(s,a)): 0
>- if one pattern, among the patterns used in MoGo's simulation policy,
>matches for move "a" in position "s", then 1
>- else 0.5

Thanks for that. These values were more extreme than my expectation.
I thought you use values like 0.4 or 0.6.

This depends on the choice of n_prior. Using values of 0.4 or 0.6 with a high value of n_prior can have more effect than 0 or 1 with a small value of n_prior. Either way, it takes some time for the simulations to correct the prior knowledge. How much time depends on both the initial value (how extreme) and how much weight we give to the prior knowledge (n_prior).

-Dave

_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to