Re: [computer-go] UCT concept

Jason House Tue, 27 Jan 2009 06:45:54 -0800

On Jan 26, 2009, at 6:26 PM, matt harman <harman.m...@hotmail.co.uk>wrote:


> That the missunderstanding right there.
> 1 child will be chosen and 1 simlation will be run.

Thanks for the quick answer, so 1 simulation is run because too many
will give lots of noise to the result?

Just the opposite. The noise in the win rate is 1/sqrt(n). The reasonfor doing few simulations is that they're relatively expensive. UCTand MCTS do extra tree walking because that's less expensive than oversimulating bad moves.

if only 1 is run then the 4 children can either win or lose
the single simulation 0 or 1.

Initially, yes, but future visits will increase that. Near the rootcould have hundreds of thousands.

This would be non-deterministic so how would you
decide which child to exploit?

Techniques vary a bit, but a move quality is calculated based on thesimulation results for each move. The best move is simulated.

UCT balances exploitation with exploration by including uncertainty inmetric.

Many top programs have discarded the uncertainty term becauseheuristics and RAVE/AMAF are surprisingly accurate.

Given specific simulation results, the searches are deterministic.Simulations are random, so the searches end up very non-deterministic.Over time, they should converge on the same conclusion.



Thanks

Matt
> _______________________________________________
> computer-go mailing list
> computer-go@computer-go.org
> http://www.computer-go.org/mailman/listinfo/computer-go/

Beyond Hotmail - see what else you can do with Windows Live Find outmore!

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] UCT concept

Reply via email to