This version's default player (Lgrf2Player) uses the last-good-reply
policy, and in fact an improved version described in an upcoming
paper. We do not use the Elo-based heavy playouts, because we were
never able to get them to run quickly enough to really offer an
improvement.
Peter Drake
http://www.lclark.edu/~drake/
On Jan 10, 2011, at 8:27 PM, Aja wrote:
Hi professor Drake,
I read your paper "THE LAST-GOOD-REPLY POLICY FOR MONTE-CARLO GO"
and was very surprised with the performance of the heuristic "The
Last-Good-Reply Policy". In your experiment, it boosts the wining
rate from around 40% to almost 60%. I wonder does this version of
Orego feature this heuristic? Or maybe you have combine this
heuristic with the "Elo-Based Heavy Playouts" described in your
paper "Investigating the Eects of Playout Strength in Monte-Carlo
Go"?
Thanks,
Aja
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go