Don Dailey: <[EMAIL PROTECTED]>:
>
>
>[EMAIL PROTECTED] wrote:
>>> I experimented with something similar a while ago, using the
>>> publicly available mogo and manipulating komi between moves.
>>>
>>> If its win probability fell below a certain threshold (and the move
>>> number wasn't too high), I told it to play on the assumption that it
>>> would receive a few points more komi (and similarly when the win
>>> probability became high).
>>>     
>>
>> That's certainly a nice solution !
>>
>> It's probably easy to implement, and relatively easy to tune.
>>
>> Now, the question is: did it seem stronger to our eyes because it played
>> more human-like ?
>>   
>I experimented with this idea extensively a while back and never found
>an implementation that improved it's playing ability.    In fact every
>version played weaker.    I tried versions that dynamically adjusted
>komi during the course of a game when things looked close to hopeless
>(or certain) and I came to the eventual conclusion that whenever you
>didn't use the correct komi,  you were in fact decreasing it winning
>chances.     If you tell it that is not winning when it really IS
>winning by 1/2 point,  then there will be games where it plays stupid
>desperate moves when it doesn't have to.     
>
>If it is almost losing and you tell it (by adjusting komi) that it has a
>good chance,  it will tend to lose with a higher score,  but you have
>essentially treated it as a spoiled child,  lowering your expectations
>for it and it is happy to play with the new goal of losing.  
>
>This whole concept is based on what appears to be a flawed idea,  
>deceive the program into believing something that isn't true in the
>hopes that it will somehow make it play better.   

I don't agree. If the estimated score by playouts had no error, you 
were correct.

I've applied my formula (in my previous post) when estimated winning 
probability beyonds 65% and beneth 45% or so and get better 
results.  For example, withou this, my program sometime lost games 
against weaker programs due to misunderstanding of L&D at corners 
with nakade.

-Hideki

>We had this conversation recently but with idea of changing the scoring
>algorithm, not the komi.    I think the same principles apply.    
>
>I actually think some variation of the idea would work if we understood
>the issues better.    In SOME positions it might very well play
>objectively better if we lie to it by  lowering  or raising the komi or
>manipulating the scoring algorithm,  but we have to pick and choose the
>types of positions where this doesn't lead to weaker play.     For
>instance it's good if it makes it fight better,  but in many positions
>lowering the komi to make it fight in a losing position will cause it to
>consolidate a losing position.     UCT isn't going to fight when it can
>consolidate.   It doesn't care how difficult a position is to play for
>you,  only for it.

It depends on the property of playouts of programs, I guess.  That 
is, if the error of the estimated score decreases in monotonic 
manner, this idea won't help any.

-Hideki
--
[EMAIL PROTECTED] (Kato)
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to