Re: [computer-go] How to "properly" implement RAVE?

Mark Boon Wed, 21 Jan 2009 05:26:19 -0800


On Jan 21, 2009, at 10:23 AM, Magnus Persson wrote:

Quoting Thomas Lavergne <thomas.laver...@reveurs.org>:
 - the best play is a good only if played immediatly and very bad if
   played later in the game :
 - the first playout for this play resulted in a lost.
score and RAVE score will be very low and this play will never be
considered again until a very long time.
You raise an interesting concern.
The simple solution to your question is to add an exploration termusing UCT for example. Then it becomes an empirical question whatparameter for exploration gives the strongest play. My experience isthat the best parameter is so small it can be set to zero.

Well, empirically, when I set the exploration component to zero itstarts to play a lot worse. Like I wrote: the winning percentage dropsto 24% vs. the same program with the exploration component, which is ahuge difference.

So if you have a different experience, you must have something elsethat overcomes this hurdle that's not part of a simple MCTS-RAVEimplementation. I'd be very interested to learn what that is. Sylvaindidn't take the bait ;-)


Mark

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] How to "properly" implement RAVE?

Reply via email to