Re: [Computer-go] A Linear Classifier Outperforms UCT on 9x9 Go

Peter Drake Wed, 29 Jun 2011 14:54:31 -0700

On Jun 29, 2011, at 2:39 PM, Brian Sheppard wrote:

>Such a table has 47 million entries in 19x19 go and over half amillion in 9x9 go. It require an enormous amount of data to fillsuch a table with enough samples to not be almost meaninglessstatistically.
That's true, but... But there are two other aspects to the math.

1) Classifiers are expensive to train

Note that the paper compares on fixed trials rather than fixed time.


The first experiment is on fixed trials; the rest are on fixed time.

A network with N outputs, H hidden nodes and K inputs has H * (N +K) weights. Here, K is the board size (9x9 = 81), and N varied from10 to 40, IIRC. Orego is training those weights on every position inthe playout (That is, roughly K moves worst case). Each updateinvolves dozens of floating point calculation.

We found that hidden units hurt, even with fixed playouts. As thepaper explains, a linear classifier performs even better (on a timebasis) than a sigmoid network with no hidden units. The number ofweights trained on each playout move is only 3, each of which onlyrequires a few flops.

Basically: RAVE uses (a lot) less computation and gets a result thatis better. (But maybe not much better; hard to tell because there isno comparison to Orego+RAVE.)


Yes, RAVE is much better. :-)

Peter Drake
http://www.lclark.edu/~drake/

_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Re: [Computer-go] A Linear Classifier Outperforms UCT on 9x9 Go

Reply via email to