This version's default player (Lgrf2Player) uses the last-good-reply policy, and in fact an improved version described in an upcoming paper. We do not use the Elo-based heavy playouts, because we were never able to get them to run quickly enough to really offer an improvement.

Peter Drake
http://www.lclark.edu/~drake/



On Jan 10, 2011, at 8:27 PM, Aja wrote:

Hi professor Drake,

I read your paper "THE LAST-GOOD-REPLY POLICY FOR MONTE-CARLO GO" and was very surprised with the performance of the heuristic "The Last-Good-Reply Policy". In your experiment, it boosts the wining rate from around 40% to almost 60%. I wonder does this version of Orego feature this heuristic? Or maybe you have combine this heuristic with the "Elo-Based Heavy Playouts" described in your paper "Investigating the E ects of Playout Strength in Monte-Carlo Go"?

Thanks,
Aja
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Reply via email to