Re: [Computer-go] Results from 19x19 Valkyria/1k H9 vs Valkyria/10k

valkyria Tue, 05 Oct 2010 09:40:53 -0700

No information is thrown away with maximizing win rate.

:-)

This is because in go it is only the sign of the final board countthat matter.


Yes I know I am very stubborn and narrow minded on this issue.

It is a weird thing to fear throwing away "information" and thenestimate the expected score which will throw away the actualdistributions of outcomes. And so will any other statistical measure do.

Also I do measure the score in at least two ways in Valkyria. One issimply taking the average of score at the root and the other is toestimate territory directly by looking at black/white membership ofindividual points of the board. The latter is much more stable and theaverage is often off several points also late in the endgame.

It could also be that different programs have very different kind ofplayouts. I know that the playouts of Valkyria contains really weirdstuff. Sometimes black wins a playout with +100 points because a lotof perfectly safe white groups died because of some really unlikelycombination of bugs, omissions and randomness in the playout. Such ascore does not give me information with any value.

I do agree that maybe expected score could be useful in the opening,because the opening has no systematic bias yet. In the endgame howeveralmost every group will have a small probability of flipping statefrom alive to dead or dead or alive, which has a little to do withactually theoretical score of perfect play. But these probabilitieswill add up to something that on average close to the true value butmostly it will be very wrong.

So in short, I think win rates is the most robust thing to evaluatepositions using MC playouts. And someone else have to prove me wrong!I wont do it. (But if someone do prove me wrong I will of course stealthe idea and implement it).


-Magnus





Quoting Łukasz Lew <[email protected]>:

On Tue, Oct 5, 2010 at 15:34,  <[email protected]> wrote:

Quoting Łukasz Lew <[email protected]>:

Your dynamic komi results are very convincing..
But shouldn't we just concentrate on maximizing score instead of
winning rate in the beginning of the game?


Maximizing winning rate means that the probability of having a score > 0 at
the end of the game is maximum.


I want to maximize expected score of a playout.

I think that if we set komi so that around half of the playouts havescore > 0

AND if the noise is large (the game is in the beginning) then
maximixing winning rate
is almost the same as maximizing score.

The drawbacks of maximizing winning rate are:

- we need to adhoc adjust komi to tell engine to concentrate onmaximizing score

- we throw away information.

Of course maximizing winning rate is the right way in low-noise
conditions (endgame).


Exactly what do you mean with "maximizing score in the beginning of the
game"?
It is hard to estimate the score. Also in a game of go territory is not
everything. Aji and influence is also important. Win rate is as far as i
know the best way of capturing all these things in one measure that guides
search.

Magnus


_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go




--
Łukasz
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go



_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Re: [Computer-go] Results from 19x19 Valkyria/1k H9 vs Valkyria/10k

Reply via email to