Re: [computer-go] Positions illustrative of computer stupidity ?

Chrilly Fri, 24 Nov 2006 02:02:29 -0800

I can't help but feel we are missing something.   With UCT we miss those
wonderful beta cutoff's that you get with straightforward alpha beta
pruning - but with alpha beta pruning you are still exploring a lot more
useless nodes.    It seems the 2 are just not very compatible.

Currently UCT does seem like the best choice by far.

Yes, in the current state of the art. What would be great is to understandin

what extend this would continue to be true if we change the evaluation
function. What I believe (but this is only intuition, this have to be
proved !), is that UCT is very efficient in Go because our evaluation
function is bad. With a good/(very)^+ good evaluation function, alpha-beta
would become much better than UCT. I think this is clearly the case if you
can reach the end of the game in the tree for example.
I don't know if someone tried UCT in chess to see the results. As we often
assume that the evaluation function in chess is much better than in Go, it
would be a good test case?

I have not tried UCT for chess, but i have tried to combine UCT with theSuzie-Eval. Not to the end of the game, but up to 11 plies deep.I compared this with the standard 4 to 5 Plies Alpha-Beta search. Allexperiments were done on 19x19 (Suzie runs only at that size).First of all I had better results with iterative deepening. Searching1,3,5,7,9,11 Plies. Each depth was given a fixed node count. Only when thebest move was clearly better the next iteration depth was started earlier.It producted reasonable strategic play, some good moves, but the number ofreally bad moves was also higher than with Alpha-Beta. The increased searchdepth and the random nature of the search finds better the stupidities ofthe eval.I stopped this experiments, because I had/have the feeling that the playingbehaviour is dominated in all cases by the eval and that I should spenteffort there. Additionally with a shallow Alpha-Beta search it is easier tofind the reason of bad moves. One can go down the criticial line and canlook what went wrong. With UCT this is almost impossible.I tried out also some sort of Alpha-Cutoff. As long as the current best movein a node was good enough to refute the line, no new move was tried out. Thewhole search was a (strange) mixture of the UCT idea and Alpha-Beta.

The Suzie eval is also too slow for statistic sampling on 19x19.

In hindsight I am also from the theoretic point of view sceptical about theidea. One probably gets the Austrian-Opinion-Poll problem. It seems to bevery difficult to create an unbiased eval. It is also a problem ofweighting. One extreme value can dominate the other more realistic ones. Onehas certainly to introduce some form of robust statistics. E.g. using themedian instead of the mean or ignoring the minimum and the maximum value.



Chrilly

_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Positions illustrative of computer stupidity ?

Reply via email to