I believe we used a uniform random policy (only "don't play in your
own pseudoeyes").
The numbers probably won't be the same, but we've certainly replicated
the qualitative improvement with version 6.05 of Orego, available here:
https://webdisk.lclark.edu/drake/orego/
Peter Drake
http://www.lclark.edu/~drake/
On Jun 23, 2009, at 9:24 AM, Christian Nentwich wrote:
Peter,
I tried to reproduce this, so I gave this a whirl and the win rate
against UCB-Tuned1 with first move priority of 1.1 (like Mogo) was
only 33%. That was using uniform random playouts.
What was the playout policy you used for this?
Christian
On 18/06/2009 21:04, Peter Drake wrote:
An improvement on the UCB/UCT formula:
Stogin, J., Chen, Y.-P., Drake, P., and Pellegrino, S. (2009) “The
Beta Distribution in the UCB Algorithm Applied to Monte-Carlo Go”.
In Proceedings of the 2009 International Conference on Artificial
Intelligence, CSREA Press.
http://webdisk.lclark.edu/drake/publications/BetaDistribution.pdf
Peter Drake
http://www.lclark.edu/~drake/
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/