Peter,
I tried to reproduce this, so I gave this a whirl and the win rate
against UCB-Tuned1 with first move priority of 1.1 (like Mogo) was only
33%. That was using uniform random playouts.
What was the playout policy you used for this?
Christian
On 18/06/2009 21:04, Peter Drake wrote:
An improvement on the UCB/UCT formula:
Stogin, J., Chen, Y.-P., Drake, P., and Pellegrino, S. (2009) “The
Beta Distribution in the UCB Algorithm Applied to Monte-Carlo Go”. In
Proceedings of the 2009 International Conference on Artificial
Intelligence, CSREA Press.
http://webdisk.lclark.edu/drake/publications/BetaDistribution.pdf
Peter Drake
http://www.lclark.edu/~drake/
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/