Quoting Darren Cook <[email protected]>:
When the programs select the best move to play it also knows what move
is the *most likely best move* for the opponent given search to that point.
With speculative pondering all pondering goes into searching the reply
to that *most likely best* move.
If the opponent plays a different move, this move will on average be a
worse move than the expected one. Although there was no gain in
pondering, my program has a better than expected position (on average).
I think this is too optimistic: it is surprisingly rare that any given
board position has just one best move. E.g. I've analyzed complex 9x9
middle games (*) where the played move lead to a 0.5pt loss; in trying
to find a win I've discovered 3 or more other moves that also lead to a
0.5pt loss. (After all, if this wasn't the case MCTS wouldn't work so well.)
If we assume your opponent is roughly the same strength as you, then
when they play an unexpected one the chances are they've not made a bad
move, just an alternative good one.
You are right of course, but I think in practice MC-programs tends to
discover the move that makes search most difficult for the opponent.
(Not everytime of course but on average). This is not the same thing
as playing the theoretically best move. So even if many move are
equal, there can still be a move that has better practical winning
chances. And the best guess is the move that is best in the search
sofar.
A separate issue which probably makes things complicated is when to
stop searching. Should the program stop searching when it know for
sure what to play? Or it should it search until it knows what move to
ponder? And how does the ponder strategy interact with opponent
programs. Good results on CGOS may just mean that the program can
exploit one or more programs with a bad time allocation.
Best
Magnus
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go