I've been wrapping my head about dynamic komi adjustments for MCTS,
namely on the thesis by the Pachi creator, Petr Baudic.
On value-based situational compensation the author uses the average on
win rates from the previous simulations to decide whether or not to
change the komi. But I don't see how this criteria makes sense, if we're
interested in finding the best play, shouldn't we be trying to have good
sensibility around the best plays? Trying to average the game only
worsens the ability of the search to differentiate the best contenders.
Am I seeing this wrong? Has this been addressed before? What do other
engines do?
- Gonçalo F.
_______________________________________________
Computer-go mailing list
[email protected]
http://computer-go.org/mailman/listinfo/computer-go