On Wed, Sep 2, 2009 at 19:45, Matthew Woodcraft<matt...@woodcraft.me.uk> wrote:
> Łukasz Lew wrote:
>> If the weight in RAVE formula is near 1 in one child of tree and near
>> 0 in other then you basically compare RAVE value to regular average
>> value, which might be comparing apples to oranges.
>
> Yes, and this can cause problems in practice. There's been some
> discussion of this before.

Can you give me a link or date?

Lukasz

>
> In positions where the RAVE values tend to be too high, the effect is
> that moves with few visits will be favoured, which will then equalise
> the RAVE weight again. The effect is rather like temporarily increasing
> the exploration coefficient, and nothing very bad happens.
>
> But in positions where the RAVE values are too low (which mostly means
> positions where the current player is winning), the effect is worse: the
> program will be reluctant to explore different moves, and this time
> there is positive feedback (the RAVE weights will diverge) and so the
> situation won't correct itself.
>
> -M-
> _______________________________________________
> computer-go mailing list
> computer-go@computer-go.org
> http://www.computer-go.org/mailman/listinfo/computer-go/
>
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to