Peter,

I tried to reproduce this, so I gave this a whirl and the win rate against UCB-Tuned1 with first move priority of 1.1 (like Mogo) was only 33%. That was using uniform random playouts.

What was the playout policy you used for this?

Christian

On 18/06/2009 21:04, Peter Drake wrote:
An improvement on the UCB/UCT formula:

Stogin, J., Chen, Y.-P., Drake, P., and Pellegrino, S. (2009) “The Beta Distribution in the UCB Algorithm Applied to Monte-Carlo Go”. In Proceedings of the 2009 International Conference on Artificial Intelligence, CSREA Press.

http://webdisk.lclark.edu/drake/publications/BetaDistribution.pdf

Peter Drake
http://www.lclark.edu/~drake/



_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/


_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to