Re: [computer-go] Experimentation

Christian Nentwich Tue, 07 Jul 2009 09:16:25 -0700

Magnus,

along the lines of the argument I am trying to make: did you try yourexperiments with time limits from 30 seconds per game to five minutesper game (say), rather than playouts? Using gogui-twogtp this isrelatively easy to achieve.

I am obviously not associated with Fuego, but I guess it is reasonableto assume that Fuego's architecture was not designed to operate atlimits like 2, 8 or 32 simulations in the same way Valkyria was. It isan interesting study in its own right for scalability purposes; but togo on to infer strength from it would seem like comparing apples andoranges.


Christian


Magnus Persson wrote:

Quoting Darren Cook <[email protected]>:
* The scaling behavior might be different. E.g. if Fuego and Valkyria
are both run with 10 times more playouts the win rate might change. Just
to dismiss an algorithm that loses at time limits that happen to suit
rapid testing on today's hardware could mean we miss out on the ideal
algorithm for tomorrow's hardware. (*)
I just happened to have experimental data on exactly this topic. Thisis Valkyria vs Fuego where I scale the number of playouts (Sims) x4 ineach row.
Sims    Winrate    Err    N    WR    EloDiff
2    99.2    0.4    500    0.992    837
8    98.2    0.6    500    0.982    696
32    94.2    1    500    0.942    484
128    88.8    1.4    500    0.888    360
512    82    1.7    500    0.82    263
2048    83.2    1.7    499    0.832    278
8192    81.3    1.7    497    0.813    255
32768    75.5    3.6    139    0.755    196
The data shows clearly that the 0.3.2 version of Fuego I use, probablyplays really bad moves with a high frequency in the playouts. Withmore playouts a lot of these blunders can be avoided I guess and thewin rate goes down from 99% towards 80%. The question here if it goesasymptotically towards 80% or perhaps 50% with more simulations.Unfortunately I cannot continue this plot because I run out of memoryand it takes ages to finish the games.
So the question is then: are there a fixed gain of the heavy playoutswith more than 512 simulations or does the effect of the heavyplayout get less and less important with larger tree size.
Note also that this also not only a matter of having heavy playouts ornot. There is also a difference in tree search since Valkyria andFuego probably search their trees differently, and it could be thatValkyria search deep trees
inefficiently.
Maybe I should run a similar test against a light version of Valkyriato control for the search algorithm.
-Magnus
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/



--

Christian Nentwich

Director, Model Two Zero Ltd.
+44-(0)7747-061302
http://www.modeltwozero.com

_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Experimentation

Reply via email to