Hello,

David Doshay wrote:
> On 3, Feb 2007, at 2:51 AM, Sylvain Gelly wrote:
>
>> the speed of the best simulation policy in MoGo is 0.6 * the speed
>> of the uniform random one.
>
> I think that this is very good. You give up less than a factor of 2
> from uniform random and you probably get better than a factor of 2 in
> the % of relevant moves.

Is there any known (by theory or tests) function of how much a increase
in the strength of the simulation policy increases the strength of the
MC/UCT Program as a whole?

Imagine you're using a almost random playout that has a strength, if
playing alone, of say 10 ELO (just a number, I don't know how strong it
really is). Using some thousand MC simulations with that 10 ELO playout
and some other clever tricks like UCT, and you get a program that plays
with 1500 ELO.

Now what if you change the playout to one that has a strength, if playing
alone, of 20 ELO, 50 ELO or 100 ELO. How whould that affect the strength
of your overall programm, if you run the same number of MCs? Any known
numbers on that?

[I understand that a non-random playout would decrease the speed, so
couln't run as many simulations as before in the same time, but let's
imagine we get it for free.]

eph

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to