MoGo has a notion of "internal" node in the tree (as most of the UCT
programs I think) and the state-action pairs are only kept for those.

Sylvain

2008/7/2 Jason Galbraith <[EMAIL PROTECTED]>:
> I've been looking at RAVE (Rapid Action Value Estimate), which MoGo uses.  The
> score of states during simulation is stored in state-action pairs, which are
> all updated with the playouts, rather than just those states visited in the
> tree.  How would you store these scores?  The number of potential states
> visited seems prohibitively large.
>
> Jason Galbraith
> Orego research group
>
>
> _______________________________________________
> computer-go mailing list
> [email protected]
> http://www.computer-go.org/mailman/listinfo/computer-go/
>
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to