Suppose an MC/NN program suggests a move with the supposedly highest winning probability. Due to the imperfect information for suggesting this move, I suggest to apply my human player principle "verify by reading" by verifying the suggested move by reading. This can use the following method "verification reading for probabilities", which I suggest now:

We have the current position with a winning probability and the suggested first move in the current position. Do _reading_ to quiet leaves, for which we know or determine a winning probability. A decision in the tree is a success iff it results in _at least_ the same probability [alternative: a threshold probability] as that of the current position.

I.e., the verification reading shall ensure that the winning probability does not decrease as a consequence of starting with the suggested move.

Such should prevent the program from suddenly experiencing harshly dropping probabilities during a few successive moves / mistakes of the game, such as in game 4 of the AlphaGo - Lee Sedol match.

--
robert jasiek
_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

Reply via email to