Re: [Computer-go] Computer-go Digest, Vol 12, Issue 79

Hendrik Baier Wed, 26 Jan 2011 01:01:00 -0800

Hi Aja,

I would be interested in your results. I think the LGRF policy is only asmall first step into the direction of more adaptive playouts (andhopefully the overcoming of the horizon effect).As for the Last-Bad-Reply idea, you can read about my experiences withthis and related policies in my Master's thesis, if you're interested.It contains the idea that resulted in the "Power of Forgetting" paper aswell.

http://www.ke.tu-darmstadt.de/lehre/arbeiten/master/2010/Baier_Hendrik.pdf


regards,
Hendrik

I admit that it's difficult for me to include such deterministic default 
policy. :-)
With softmax policy, using the information of "last-LOST-reply" is maybe a good 
direction.

Aja


_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Re: [Computer-go] Computer-go Digest, Vol 12, Issue 79

Reply via email to