Re: [Computer-go] using the opponents time for chosing komi

Stefan Kaitschick Tue, 14 Sep 2010 04:22:37 -0700

Adjusting the winrate to 50% is unsound. Especially in handicap gameswhen the bot takes black.Intuitively it means going "head to head" with a stronger opponent,which is not the most promising course of action.The main reason to adjust the komi is to give the bot some criteria ofsuccess when "objectively" every move is a win or every move is a loss."Sacrificing" winrate is actually not very good in any case. But if akomi burden can be placed on the bot without a significant drop of thewinrate that actually represents an added layer of safety.


Stefan



Am 14.09.2010 13:03, schrieb Willemien:

If you use "whole number" komi and your program treads draws at 1/2wins 1/2 losses (or what is maybe easier a draw and a loss, and treatwins as 2 wins and losses as 2 losses )

then the winrate at the right komi should be 50%

so if the real winrate is far below (-25%) or above (75% +) this youneed to adjust the komi


The problem is that your program needs to do something with draws.
(and the normal UCB formulas don't work)

On Tue, Sep 14, 2010 at 10:28 AM, Stefan Kaitschick<[email protected] <mailto:[email protected]>>wrote:


     The usual way to use the opponents time, if it is used at all, is
    to guess the opponents move, and do a normal search on that
    presumption.
    If the guess was correct, then the search results can be used,
    otherwise they are thrown away.
    This is probably the most reasonable way, but ofcourse the
    effectiveness is not so great.
    Getting back to the the dreaded topic of dynamic komi, I would
    like to suggest an alternate way of using this time:
    Do a normal move search at several different komi levels, each
    with a fraction of the normal number of positions.
    It's not about finding the best move, only about estimating the
    winrate at different komi levels.
    This komi vs winrate profile could then be used to determine komi
    for the next search.
    Basically adjust the komi to the largest value that avoids a
    significant drop from the winrate at regular komi.
    The standard approach probably goes for a certain winrate that
    must be maintained, but that way it's impossible to say how much
    additional risk is actually beeing taken.

    Stefan
    _______________________________________________
    Computer-go mailing list
    [email protected] <mailto:[email protected]>
    http://dvandva.org/cgi-bin/mailman/listinfo/computer-go



_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Re: [Computer-go] using the opponents time for chosing komi

Reply via email to