No information is thrown away with maximizing win rate.

:-)

This is because in go it is only the sign of the final board count that matter.

Yes I know I am very stubborn and narrow minded on this issue.

It is a weird thing to fear throwing away "information" and then estimate the expected score which will throw away the actual distributions of outcomes. And so will any other statistical measure do.

Also I do measure the score in at least two ways in Valkyria. One is simply taking the average of score at the root and the other is to estimate territory directly by looking at black/white membership of individual points of the board. The latter is much more stable and the average is often off several points also late in the endgame.

It could also be that different programs have very different kind of playouts. I know that the playouts of Valkyria contains really weird stuff. Sometimes black wins a playout with +100 points because a lot of perfectly safe white groups died because of some really unlikely combination of bugs, omissions and randomness in the playout. Such a score does not give me information with any value.

I do agree that maybe expected score could be useful in the opening, because the opening has no systematic bias yet. In the endgame however almost every group will have a small probability of flipping state from alive to dead or dead or alive, which has a little to do with actually theoretical score of perfect play. But these probabilities will add up to something that on average close to the true value but mostly it will be very wrong.

So in short, I think win rates is the most robust thing to evaluate positions using MC playouts. And someone else have to prove me wrong! I wont do it. (But if someone do prove me wrong I will of course steal the idea and implement it).

-Magnus





Quoting Łukasz Lew <[email protected]>:

On Tue, Oct 5, 2010 at 15:34,  <[email protected]> wrote:
Quoting Łukasz Lew <[email protected]>:

Your dynamic komi results are very convincing..
But shouldn't we just concentrate on maximizing score instead of
winning rate in the beginning of the game?

Maximizing winning rate means that the probability of having a score > 0 at
the end of the game is maximum.

I want to maximize expected score of a playout.
I think that if we set komi so that around half of the playouts have score > 0
AND if the noise is large (the game is in the beginning) then
maximixing winning rate
is almost the same as maximizing score.

The drawbacks of maximizing winning rate are:
- we need to adhoc adjust komi to tell engine to concentrate on maximizing score
- we throw away information.

Of course maximizing winning rate is the right way in low-noise
conditions (endgame).


Exactly what do you mean with "maximizing score in the beginning of the
game"?
It is hard to estimate the score. Also in a game of go territory is not
everything. Aji and influence is also important. Win rate is as far as i
know the best way of capturing all these things in one measure that guides
search.

Magnus


_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go



--
Łukasz
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go


_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Reply via email to