Re: [computer-go] Allocating remaining time

Chrilly Thu, 04 Jan 2007 23:13:39 -0800

How much time should a program spend on each move?

I think this is one of the most important and also difficult questions ingame programming. Much effort is done to speed up the node-count by 10%, buta good time control is a much more effective speedup.

If my program has t milliseconds left to use in a game, and there are anestimated m moves left on the board (e.g., this many vacant spaces), onereasonable choice is t / m.

One should at least use t/(m+1). There is also a locial reason for this. Ifm is very small, especially m==1 one should have some extra time if theprogramm recognizes a problem. In this case it should search deeper.Generally this t/(m+k) should only be a target time. The final decisionshould be based on the results of the search. It is important to recognizetrivial/forced moves and to stop in this cases search earlier. If theprogramm sees a problem than it should search longer.I have made recently a simple (but strong) UCT backgammon programm. UCTgives much better information for time-control than Alpha-Beta. E.g. ifalmost all search effort is concentrated on the best move, one canreasonable conclude that its a trivial/forced move. If the eval of the bestmoves decreases in the last period constantly and there are some chancesthat the second best becomes best, one should search on....

In practice, this seems to spend too much time on early moves, which(under UCT/MC) is largely wasted time. Would it be better to usesomething like t / m**k, for some constant k? (Looking at graphs of suchfunctions, k = 1.5 seems reasonable.)

Go-Programmers like it complicated.

It would also be interesting to look at the graphs of how much timehumans spend on each move; is it usually less for the opening moves thanfor middle / endgame moves? Is there a smooth curve, or is there arelatively abrupt shift from joseki to analysis?

One should forget human behaviour. If I would have to make a Turing test -is the player human or a programm - I would not look at the moves but on thetime behaviour. The fundamental difference is that (good) humans know whenthe position is difficult and when its easy. Programms have no understandingof this at all. Humans play Chess/Go, programm make chess/Go moves.Consequently humans think for a few moves very long, and play other movesrather fast. But I think that the time-control of humans is not at alloptimal. Its very human to try to solve an urgent problem even at the riskthat it makes solving a further problem more difficult. Humans tendtherefore to get into Zeitnot.When playing against GM Adams I proposed 40 Moves in 2 hours. He proposed 40Moves in 1 hour 40 minutes plus 30 sek/move. In the first moment I could notsee the difference. In both cases one has 2 hours for 40 moves. But at move30 its different. The flag is falling there already at 1h 55 minutes. Its apsychological trick to avoid extreme Zeitnot. But if the human would have agood time-control "algorithm" there is no need for this trick. He could savethis 30 seks for himself.


Chrilly

Note: One should forget human behaviour generally. A programm is a programmis a programm.


_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Allocating remaining time

Reply via email to