Re: [computer-go] RE : UCT RefBot

Mark Boon Fri, 21 Nov 2008 06:22:34 -0800


On 21-nov-08, at 09:34, Denis fidaali wrote:

I think that most people trying go-programming will try at leastto experiment once with UCT.The first logical step, is to build an amaf-bot. The other logicalstep, is to build a UCT bot. That's exactly the path i followed.And i bet many others have done that too. So it may be guessed thatmany more will do so. I feel that the most important thing is to beable to be rightly confident that the implementation is roughlyright. It's true that an implementation can also serves as a basisfor something else. But that will not be possible if you can't geta strong confidence that it is rightly done.
So i guess that you should keep things as simple as you can inyour reference-implementation. Litles tweaks will be easily doableafter you get the specification understandable once. For exemple, ido not think that the "explore the pass" lines are a must have. Youcan test an UCT implementation that never pass as long as there are"valid" moves left. (valid in the sense nor suicide, nor pseudo-eyes). That's simple. And yet i think the program can play stronglyenougth (given enougth simulations are made - Say 50 000).
UCT has many constants built in. (By the way, i don't reallyunderstand those 2 and 10 factors. Wouldn't that go in theexploration-factor ?? as *sqrt(1/5) ). I guess that any value wouldbe good enougth, as long as it makes the behavior of the bot ratherclear. So other people can adjust this factor, and compare theirresults. If later on, after the implementation has cought someattention, if one value get to be known as better, it'll still betime to put it in. It probably won't be a big fuss to adjustanyone's implementation to it anyway. You have to set up aconventionnal value that people can use as a reference, be it bad.SO 1.0 (or sqrt(1/5) would be Okay i suppose).
I don't think that wasting simulations in the end-game is really aproblem for a reference implementation.
The main problem i spot, is that you may need a fair number ofsimulations, to get some inter implementations reproducible data.Not everyone will be able or willing to put so much computing powerto that usage. But even then, to have a reference specificationwill never be a loss. Especially once the AMAF-reference-specification start to get it's own pages (if it's not already thecase : i have always have great trouble to track out all the links.Maybe it would be good to put it as a CGOS partner or something :)Then all people have to know is where to find CGOS. So the goal ofthe UCT-reference would be to be presented along with the AMAF-reference, with all the data that has been collected about how tomake "sure" that one implementation behavior is correct. And alsomaybe, along with some popular boost (like the weight for the AMAF-ref), or a Basic way of making RAVE work with it. But that'll belater.


Denis,

I agree with most of what you write. But there's a bit of frictionbetween two of the goals. On the one hand a reference implementationis ideally as simple as possible. On the other hand you need to takelimited computing power into account for testing. For testing newideas you want your base-line (which will be something similar to thereference-bot) to have as good a strength/CPU-time ratio as possible.

The simplest and cleanest would be a UCT-search without AMAF (orRAVE). But if it turns out AMAF would give a big boost in strengthfor virtually the same performance I think it should be considered.In my search implementation AMAF adds something like 20-30 lines ofcode in a single place, so the impact on the complexity is not solarge. So far I'm still testing whether AMAF actually adds much ornot. But if it turns out it doesn't it will be easy to remove.

With regards to the formula sqrt( 2 * (log(parent-visits) / (10*visits)) I admit I simply copied it from somewhere and never properlythought about it. so you're correct that in fact I'm using aexploration-factor of sqrt(1/5) instead of 1.0. I'll modify my codeto make this more clear.


Mark

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] RE : UCT RefBot

Reply via email to