Hi Yamato,

I like you idea, but why do you use only 5x5 and 6x6 Go?

1. Our second algorithm, two-ply simulation balancing, requires a training set of two-ply rollouts. Rolling out every position from a complete two-ply search is very expensive on larger board sizes, so we would probably have to consider some subset of leaf positions. We wanted to analyse the full algorithm first, before we started making approximations. 2. We can generate a lot more data on small boards, to give high confidence on the results we report. 3. IMO it's important to do the science to understand underlying principles first, and then scale up to bigger boards, more complex Monte-Carlo searches, etc.

I don't think the 200+ Elo improvement is so impressive

I agree that it would be much more impressive to report positive results on larger boards. But perhaps it is already interesting that tuning the balance of the simulation policy can make a big difference on small boards? Also, larger boards mean longer simulations, and so the importance of simulation balancing should become even more exaggerated.

because the previous approaches were not optimized for such a small boards.

I'm not sure what you mean here? The supervised learning and reinforcement learning approaches that we compared against are both trained on the small boards, i.e. the pattern weights are specifically optimised for that size of board.

I agree that the handcrafted policy from Fuego was not optimised for small boards, which is why it performed poorly. But perhaps this is also interesting, i.e. it suggests that a handcrafted policy for 9x9 Go may be significantly suboptimal when used in 19x19 Go. So automatically learning a simulation policy may ultimately prove to be very beneficial.

I'm looking forward to your results on larger boards.

Me too :-)
Coming soon, will let you know how it goes.
But I hope that others will try these ideas too, it's always much better to compare multiple implementations of the same algorithm.

-Dave

_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to