Re: [computer-go] Depth dependent evaluation effects on monte carlo searches

Darren Cook Sat, 09 Jun 2007 00:42:32 -0700

> Let's take a basic example of a leaf node in an MC search tree that
> hasn't been expanded, but has 4 children.  Let's say that random
> simulation through the children have winning percentages of {46%, 51%,
> 47%, 48%}.  Assuming a uniform simulation policy, the winning percentage
> would be the average of the four, or 48%... but when that node gets
> expanded, it'll start discovering the winning percentages of the
> children nodes.  Now when we look at the node that used to be our leaf
> node and ask what it's winning percentage would be, we come to a
> different conclusion...  The winning percentage is either 46% or 51%
> depending on if the color to move.
> 
> In this example, the difference between the 0-ply and the 1-play is 3%
> (51%-48%) or 2% (48%-46%) depending on what the color to move was.


I hope someone else will answer as I'm confused what you saying. In the
tree a node is either for black to move or white to move, but the
winning percentage is always for the side we are playing in the game. So
expanding out a node just increases the accuracy of the winning
percentage estimate at that node.

Darren

_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Depth dependent evaluation effects on monte carlo searches

Reply via email to