[computer-go] Re: [computer go] Monte-Carlo Simulation Balancing

David Silver Wed, 29 Apr 2009 13:48:53 -0700

Hi Remi,

What komi did you use for 5x5 and 6x6 ?

I used 7.5 komi for both board sizes.

I find it strange that you get only 70 Elo points from supervised
learning over uniform random. Don't you have any feature for atari
extension ? This one alone should improve strength immensely (extend
string in atari because of the previous move).

Actually no. The features are very simple, and know how to capture butnot how to defend ataris. I'm sure that a better set of features couldimprove by more than 70 Elo, but I expect we would still see a benefitto balancing the weights correctly. For example, the Fuego policydefends ataris and follows several other common-sense rules, but theresults in 5x5 and 6x6 show that it is not well balanced on smallboards.

Let us extrapolate: I got more than 200 Elo points of improvementsfrommy patterns in 9x9 over uniform random (I never really measuredthat, it
may be even more than 200).

I guess you got more than 200 Elo on 9x9, in Mogo (Gelly et al. 2006)the improvement from uniform random was at least 400 Elo and I thinkyour simulation policy is probably at least as strong.

By the way I was sad to hear you're not working on Crazystone anymore. Is it because you are you busy with other projects?

So maybe I could get 600 more Elo points
with your method. And even more on 19x19.
I noticed that, in general, changes in the playout policy have a much
bigger impact on larger boards than on smaller boards.


As to whether we can extrapolate, I hope so :-)

I share the same feeling that improving the simulation policy will bemore impactful on bigger boards with longer simulations.On the other hand I've been surprised by many things in MC Go, sonothing is certain until we try it!


-Dave

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

[computer-go] Re: [computer go] Monte-Carlo Simulation Balancing

Reply via email to