Re: [Computer-go] Understanding and implementing RAVE

Gonçalo Mendes Ferreira Mon, 26 Oct 2015 09:04:35 -0700

I think we mean the same. So when black plays at a1 first we count it for
black everywhere but never for white. Correct?

The question is, if both black and white play a1 - because there werecaptures in the middle - whether both of them will update a1 in some oftheir nodes. I believe so.

Ah yes, the UCB term. I’ve now added it. I dimly remember now that Pachi
doesn’t use it but yes I’ve now added it. And I’m not sure I understand you
other comment. My nodes store the visits/etc, the move made, and references
to all children.

I've checked and that's with Pachi/Michi do too. And it's the closest tothe MCTS diagrams and explanations that are published around. Butinstead you could just have the information on transitions from a state,and not wins/visits of the state per se. I'm not very familiar withPachi (I couldn't find how it implements a transpositions table, or howit prunes it with memory safety).


I use a structure with
color
visits[19][19]
wins[19][19]
amaf_visits[19][19]
amaf_wins[19][19]
etc

They are very similar, but my node information doesn't need to consultthe transpositions table or follow points to select the next play. Onthe other hand sequences of play that arrive on the same node have theexact same "quality" regardless of sequence of play in Pachi, where inmy program the sequences of play only "unite" when following that stateforward.

I don't know if this difference is very small or if I'm forgettingsomething important. Maybe Pachi implements discovering if a state hasalready been expanded in a very efficient way.

I don't know how other programs do it, but instinctively, if you startedby implementing a non-UCT MCTS algorithm, the information you kept wouldbe just the initial state and statistics on the transitions, so my trainof thought seems (to me) the logical follow up.


Gonçalo F.
_______________________________________________
Computer-go mailing list
[email protected]
http://computer-go.org/mailman/listinfo/computer-go

Re: [Computer-go] Understanding and implementing RAVE

Reply via email to