David Silver wrote:

I would be very interested to hear more about Dimwit's approach, and also Remi's experiments with online learning in CrazyStone.


Hi,

My idea was very similar to what you describe. The program built a collection of rules of the kind "if condition then move". Condition could be anything from a "tree-search rule" of the kind "in this particular position play x", or general rule such as "in atari, extend". It could be also anything in-between, such as a miai specific to the current position. The strengths of moves were updated with an incremental Elo-rating algorithm, from the outcomes of random simulations.

I did not go very far in that direction, and my rule-based program is still very weak. I found that I could bring very big improvements to Crazy Stone with the techniques I described in my paper, so I focused on that. I will incorporate my patterns into the rule-based program in the future.

I found that my rule-based program scaled extremely well with larger board sizes. What about yours ?

Rémi
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to