Re: [computer-go] Time weighting in opening

Christian Nentwich Sun, 24 May 2009 03:04:33 -0700

Jason,

I used this time management code on CGOS and for off-line testtournaments so far. I cannot claim that I have made additional effortsto model/address network lag in it, so I don't yet know what else I needto do with KGS, etc. Perhaps a percentage applied to the result computedby the curve, for safety, or perhaps subtract two seconds from the timeestimate sent by the server?

You're right about the initial move estimates, it's just another thing Ioverlooked. The initial move estimate is (9 x 9 x 1.6) / 2. However,later in the game the initial move estimate (when the original fallsbelow a threshold) is set to all playable empty points, not half. Thisis because you cannot let your opponent force you to run out of time bypassing when playing with Tromp Taylor rules. It's not a problemusually, because by the time you get to the threshold, you are playingfilling moves.

I am actually not convinced that I have the right approach to estimatingthe right number of moves left, but the time curve itself works well.Refining the move estimation is a nicer problem to have. Because thecurve equates the time under it to the total time available, it can playoptimally at least in the sense that it will neither use too much, *nor*too little time as far as the whole game is concerned. Where it spendsthe time is up to the curve control parameters.


Christian


On 23/05/2009 22:57, Jason House wrote:

How have you tested your time management code? CGOS is very bad fortesting time management because it gives a gift of time on every move(to compensate for assumed network lag)
I think you might be missing a factor of two in your computations.Only half the moves in a game count against your time.
Sent from my iPhone
On May 23, 2009, at 4:26 PM, Christian Nentwich<[email protected] <mailto:[email protected]>> wrote:
This time management business is quite interesting. I looked intothis in some detail a while ago and came up with something I think isreasonable for 9x9. I'd love to hear what you all think about it.
My algorithm relies on two key parameters: the time left (which iseither reported by a server periodically, or maintained by theengine), and an estimate of how many moves are left. The estimate ofmoves left is set to 1.6 * board area (i.e. 9 x 9 x 1.6) initiallybased on the average length of playouts in experiments. Towards theend of the game, especially with Tromp Taylor rules, the algorithminstead counts the number of empty intersections left, plus a haircutfor captures. This is usually quite accurate.
So, given the time left, T, and the number of estimated moves left,M, the task is to find out how much time to spend on the currentmove. We know we want to spend (a lot) more on early moves, and lesslater.
Now assume you have moves numbered along the x axis, from 1 to M, andthe y axis shows how much time to spend on a move. I used a downwardsloping curve with the following form: 1 / x ^ (1 / n) where 'n'controls the steepness of the curve. We know the total area under thecurve *must* be equal to T, so that you provably never run out oftime given your estimated number of moves.
Integrating over dx and some algebra gives (remember n is a steepnessconstant, M is the number of moves left, T is time left):
time(current move) = T * (n - 1) / (n * (M ^ (n-1 / n) - 1)
Add a haircut of 5-10%, just in case of network funnies. Works verynicely for me, at least as far as time management is concerned, mycode is not strong yet but it never loses on time :-) Plus, it getsto spend super-linear time in the beginning. If you plot the initialcurve equation, you can see how it works.
Christian



On 23/05/2009 18:38, Don Dailey wrote:
On Sat, May 23, 2009 at 12:34 PM, Brian Sheppard <[email protected]<mailto:[email protected]>> wrote:
    >My general impression (also based on experiences from chess):
    >Distributing time rather balanced over the moves is a stable
    >strategy.
I have found in Chess that you also want to spend more time upfront. Part, but not all of the reason for this is that you don'tknow how long a game will last and you do not want to be on thelosing end of a short game where you have a lot of time left.This by itself makes early moves more important. Also, earlydecisions shape the game more than later decisions.
In 9x9 GO I have found that it's VERY beneficial to spend more timeon early moves. This seem to be more true than in chess. Ithink it is because the early game is much harder to play than theending and you don't want to have a lot of time built up playingeasy moves.
Like everything else, the trick is to find the right balance.With 19x19, time allocation is probably more difficult.
With sudden death time controls, a reasonable algorithm is to setsome percentage of the remaining time on the clock as your goaltime. For chess I have used numbers like 1/30th of the remainingtime. In my opinion the number should be a low estimate on howmany moves you expect to have to make. Although games can bereally short or really long, in general you expect that most gameswill take at least about 30 moves and not exceed 60 or 70 moves.
This does not allocate time evenly, which is good. Each move willbe played slightly faster than the previous. But it will NEVER runout of time either, at least mathematically since there is alwayssome time left over. This fraction can be tuned of course to yourcomfort level. I remember one older program used 1/60 but a coupleof years later the author reported to me that it was way too high.This was a program that dominated computer chess for a few years.
You can get a feel for this by just doing the math to see how muchtime you would have for an unusually short game or an unusually longgame. If your program supports multiple board sizes you pick adivisor that is some function of the board size, such as 1 / (N*N)(which is probably far too conservative.) So perhaps 1 / ((N*N) *0.6) where you tune the 0.6 constant.
So I'm saying that this is good in Chess and I believe based onBrian's comments and my own experience that it is ESPECIALLY good in GO.
- Don




    Reasoning on the basis of experience in chess is OK, but you must
    account for the differences between the domains.

    Chess is more or less uniformly difficult across the whole game.
    Go is not. It is definitely more difficult in the opening,
    especially
    for MCTS programs. Trials take longer in the opening, and the
    variance is larger, and the differences between moves is smaller
    (usually) which means that fewer moves are obviously forced. You
    have
    to spend more time on early moves in MCTS Go programs.

    Pebbles calculates the time required to uniformly spread the
    remaining
    time over the game. It then *doubles* that amount, and allocates
    that
    much time for the current play. This policy is not as extreme as you
    might think; it results in more-or-less uniform numbers of trials
    across the whole game. I have some experimental evidence that
    suggests
    that doubling is not enough. Perhaps the optimum multiplier is
    2.5 or 3.

    Now, this usually does not result in having to play blitz moves
    later
    in the game. (It can happen, if the opponent drags out a losing
    effort
    into 100+ turns, but that doesn't matter.)

    Mogo might have gone too far, but maybe not. There are a lot of ways
    to lose games.

    Best,
    Brian

    _______________________________________________
    computer-go mailing list
    [email protected] <mailto:[email protected]>
    http://www.computer-go.org/mailman/listinfo/computer-go/


------------------------------------------------------------------------

_______________________________________________
computer-go mailing list
[email protected]  <mailto:[email protected]>
http://www.computer-go.org/mailman/listinfo/computer-go/
_______________________________________________
computer-go mailing list
[email protected] <mailto:[email protected]>
http://www.computer-go.org/mailman/listinfo/computer-go/
------------------------------------------------------------------------

_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Time weighting in opening

Reply via email to