On Jun 29, 2011, at 2:39 PM, Brian Sheppard wrote:

>Such a table has 47 million entries in 19x19 go and over half a million in 9x9 go. It require an enormous amount of data to fill such a table with enough samples to not be almost meaningless statistically.

That's true, but... But there are two other aspects to the math.

1) Classifiers are expensive to train

Note that the paper compares on fixed trials rather than fixed time.

The first experiment is on fixed trials; the rest are on fixed time.

A network with N outputs, H hidden nodes and K inputs has H * (N + K) weights. Here, K is the board size (9x9 = 81), and N varied from 10 to 40, IIRC. Orego is training those weights on every position in the playout (That is, roughly K moves worst case). Each update involves dozens of floating point calculation.

We found that hidden units hurt, even with fixed playouts. As the paper explains, a linear classifier performs even better (on a time basis) than a sigmoid network with no hidden units. The number of weights trained on each playout move is only 3, each of which only requires a few flops.

Basically: RAVE uses (a lot) less computation and gets a result that is better. (But maybe not much better; hard to tell because there is no comparison to Orego+RAVE.)

Yes, RAVE is much better. :-)

Peter Drake
http://www.lclark.edu/~drake/



_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Reply via email to