Magnus Persson wrote:
Quoting Jason House <[EMAIL PROTECTED]>:

Does anyone have any data on just how optimistic or pessimistic the results would be? I'd like to use some heuristics that inherit winning percentages from a parent node to bias the expected winning percentage of the children nodes... and maybe pruning away portions of a search in a fixed depth monte
carlo search with iterative deepening.

This is confusing to me. Are you asking how UCT behaves in order to implement
something which is not UCT?

I'd say that I want to understand the nature of monte carlo simulations... not specifically UCT. You are absolutely correct that I'm trying to cook up something which is not UCT.


Anyway, UCT scores has the property that the score of a node changes very slowly when it is searched deep.

For my theoretical analysis, one unfortunate property of UCT is that many playouts from a set starting position means that the search tree gets expanded and that the measured winning rate is some kind of average of elements deeper in the tree. My original post asked about "monte carlo" in an attempt to avoid that issue.
But does this mean that sibling scores can be compared
to each other? I would hesitate here because the scores change as a function of the search. In difficult positions the scores for all good moves are often very similar. The scores often move up or down slowly together as function of if
the position is good or not.

Actually, "the scores moving up or down slowly together as function of if the position is good or not" is exactly the type of thing I'm trying to gauge. Essentially, I'd like to predict the performance of an unexplored branch based on available data. Not an attempt to prune it away, but rather to figure out when exploration of those branches should occur.

I attach an sgf file where i added the principal variation of Valkyria after the black opening move of the center point using 5 minutes of thinking time. For each move I give the winning percentage and the number of playouts that passed this node. For the second move of black I also give the winning percentage of
all siblings. The imortant thing here is that many of those move are only
searched 1000 times whereas the few best move were searched 1000000 times, but
there is still not much of a difference in the actual scores.

Thanks! Taking a quick look, I see the following winning percentages for the principle moves:
B: 54.3%
W:         46%
B: 54.6%
W:         46.3%
B: 54.7%
W:         47.6%
B: 54.9%
W:         47.0%

Looking at a single color, the winning percentage seems to shift by 0.2 to 0.4%... About what I'd expect to see. What confuses me though is how to interpret the jump back and forth as the color changes (about 8%). Are the percentages always the winning percentage for black? Or is it the winning percentage for the color to move?

If it's the winning percentage for the color to move, it seems really strange that it'd go up for both colors as the principle variation went on.

If it's the winning percentage for black, why does it vary so drastically?
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to