[computer-go] My bad intuitions about Monte Carlo Go

Peter Drake Wed, 23 May 2007 10:23:08 -0700

In previous versions of Orego, I have added one node per playout. Ijust changed that to add a child to a node only if that node has atleast A runs, where A is the area of the board (e.g., 81). This seemsto make the program stronger, if only because it allows me to get inmore runs. Specifically, it allows me to get (a bit) more out of my 4-processor-core machine, because the parts where the threads have tosynchronize on the tree (generating in-tree moves and incorporatingthe playout) are shorter.

Remi (and, I think, others) were already doing this. I wasn't,because I got the idea into my head that this would be throwing awayinformation. It is, but it's throwing away information that you'realmost certainly not going to use. If a move is truly bad, randomplayouts will generally reveal this; it's not worth constructing asubtree to figure out WHY it's bad.

Has anyone tried requiring more than A playouts before adding a childto a node? This seems an arbitrary threshold, but I have no idea how(other than empirically) to find the right tradeoff between treesearch and preemptive sampling.

On a related note, where are people applying heuristics? I see threepossibilities:


1) Incorporate heuristics into the UCT formula.

2) Use heuristics to order moves added as children to tree nodes thatdon't have all their children yet. It is possible to preemptivelyprune here, never exploring some moves that are rated as awful in theheuristics.

3) Use heuristics in the playouts.

I have (recently) avoided 3, on the theory that the playouts need tobe very fast and are largely nonsensical. Given my new realizationthat the playout threads are often waiting for the tree anyway, thismay not be valid. Again, it's a question of finding the right balance...


Peter Drake
http://www.lclark.edu/~drake/

_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

[computer-go] My bad intuitions about Monte Carlo Go

Reply via email to