One last rumination on dynamic komi:
The main objection against introducing dynamic komi is that it ignores the true
goal
of winning by half a point. The power of the win/loss step function as scoring
function underscores
the validity of this critique. And yet, the current behaviour of mc bots, when
either leading or trailing by a large margin, resembles random play.
The simple reason for this is that either every move is a win or every move is
a loss.
So from the perspective of securing a win, every move is as good as any other
move.
Humans know how to handle these situations. They try to catch up from behind,
or try to play safely while defending enough of a winning margin.
For a bot, there are some numerical clues when it is missbehaving.
When the calculated win rate is either very high or low and many move
candidates have almost identical win rates, the bot is in coin toss country.
A simple rule would be this: define a minimum value that has to separate the
win rate of the 2 best move candidates.
Do a "normal" search without komi.
If the minimum difference is not reached, do a new a new search with some komi,
but only allow the moves within the minimum value range from the best candidate.
Repeat this with progressively higher komi until the two best candidates are
sufficiently separated.(Or until the win rate is in a defined middle region)
There can be some traps here, a group of moves can all accomplish the same
critical goal.
But I'm sure this can be handled. The main idea is to look for a less ambitious
gloal when the true goal cannot be reached.
(Or a more ambitious goal when it is allready satisfied). By only allowing
moves that are in a statistical tie in the 0 komi search,
it can be assured that short term gain doesn't compromise the long term goal.
Stefan
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/