[Computer-go] Scaling, randomness and long thinking times Part I

valkyria Tue, 21 Jun 2011 01:56:31 -0700

Some more thoughts. I am splitting a long post in two.

Scaling is not the problem (please reread the first post in a previousthread for arguments).

I think the problem of MCTS is search instability due to selectivity.A couple of authors hinted on this. But was never able to search widerwithout making the program weaker. I think it is very important tosearch deep in go. So the question is can we search deep and wide atthe same time? I think so but not in the same search. but usingresults from previous searches in a way that does not hinder selectivesearch for doing its job I believe progress can be made.

The problem with selective search is that it may play different movesevery time the same position is searched. This is caused by the verynature of random playouts (as long as different seeds are used)because in a huge tree there are always moves that are overlooked justbecause the samples turned out the wrong estimate of the true winrate. This is a fact of statistical sampling that cannot be overcome.

Since we have a distribution of moves one move must be the best andone move must be the worst (ignoring cases where moves are equivalent.I also consider complexity important. Two moves may be equal withperfect play but against a non-perfect player the moves may be verydifferent).

And even if the best move is played in 85% of the positions in a goodgame there will then on average always be a couple of mistakes.

Valkyria has this problem no matter for how deep it searches. Thereare many positions where moves are forced and there is no choice. Butin high quality games on 9x9 the majority of positions have a choicebetween 2 or more moves.

Note that this is not an argument that the program does not scale. Thedistribution of moves considered narrows as thinking time increases soit scales properly. This is more an argument that we might improvesearch so that scales even better.

Also there is a problem of limited memory (I am now thinking ofcorrespondence games on Little Golem where a day of computation can bespent on a single move).

Valkyria will run out of memory in a couple of minutes. A simple fixis to prune the oldest nodes in the tree and reuse the nodes. Thisworks very well but it has the limitations that after a while theprogram will re-search pruned positions and "rediscover the wheel"over and over again.

***

In the next part I will explain what I have been experimenting with onCGOS and Little Golem for over half a year with Valkyria to overcometheses issues. The goal is simply to play perfectly on 9x9.


Best
Magnus
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

[Computer-go] Scaling, randomness and long thinking times Part I

Reply via email to