Re: [computer-go] Zero exploration?

Magnus Persson Tue, 23 Jun 2009 09:41:17 -0700

Yes, bad luck can be a problem.

Solutions:
1) RAVE/AMAF do bias good moves such that exploration take place anyway

2) Biased priors that initially forces many playouts for goodcandidates, so that bad luck becomes less likely for moves that arerated high using patterns or other means.3) One can try to bias all moves to be searched initially if one hasno patterns

Valkyria uses 1 and 2. I used to have 3 but at some pointed I testedit and it was really bad on large boards. Searching all moves at leastonce (or more) on 19x19 wastes way too much for no gain.

But if the 1) and 2) does not work well because the program is weakotherwise maybe 3 can be an option at least on small boards.

The hard part here is probable to have all these things workingsimultaneously, and when it started to do so in Valkyria it was reallyawesome! :-)

Nethertheless I some times observe some good moves not being searchedat all just because of random factors. I think there is a trade offhere. In order to get a really efficient search all of the time onehas to live with a small probability that some moves are overlookednow and then.

Also highly selective search will correct itself given enough time,because if the current best move is not good enough to win the winratewill drop towards 0 which allows other move to be searched as well.


Magnus


Quoting Peter Drake <[email protected]>:

There has been some talk here of using a zero exploration coefficient.
Does this literally mean using the win ratio (with one "dummy" win per
node) to decide paths through the MC tree? It seems that the best move
could easily be eliminated by a couple of bad runs.

Does this only work when using RAVE/AMAF?

--
Magnus Persson
Berlin, Germany
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Zero exploration?

Reply via email to