MoGo has a notion of "internal" node in the tree (as most of the UCT programs I think) and the state-action pairs are only kept for those.
Sylvain 2008/7/2 Jason Galbraith <[EMAIL PROTECTED]>: > I've been looking at RAVE (Rapid Action Value Estimate), which MoGo uses. The > score of states during simulation is stored in state-action pairs, which are > all updated with the playouts, rather than just those states visited in the > tree. How would you store these scores? The number of potential states > visited seems prohibitively large. > > Jason Galbraith > Orego research group > > > _______________________________________________ > computer-go mailing list > [email protected] > http://www.computer-go.org/mailman/listinfo/computer-go/ > _______________________________________________ computer-go mailing list [email protected] http://www.computer-go.org/mailman/listinfo/computer-go/
