[Computer-go] Monte-Carlo Simulation Balancing in Practice

Rémi Coulom Thu, 30 Sep 2010 06:05:19 -0700

Hi,

This is the CG'2010 paper Aja wrote with me.


Abstract: Simulation balancing is a new technique to tune parameters of a 
playout policy for a Monte-Carlo game-playing program. So far, this algorithm 
had only been tested in a very artificial setting: it was limited to 5x5 and 
6x6 Go, and required a stronger external program that served as a supervisor. 
In this paper, the effectiveness of simulation balancing is demonstrated in a 
more realistic setting. A state-of-the-art program, Erica, learned an improved 
playout policy on the 9x9 board, without requiring any external expert to 
provide position evaluations. Evaluations were collected by letting the program 
analyze positions by itself. The previous version of Erica learned pattern 
weights with the minorization-maximization algorithm. Thanks to simulation 
balancing, its playing strength was improved from a winning rate of 69% to 78% 
against Fuego 0.4.

You can download it from there:
http://remi.coulom.free.fr/CG2010-Simulation-Balancing/

Rémi
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

[Computer-go] Monte-Carlo Simulation Balancing in Practice

Reply via email to