Re: [Computer-go] Computer-go Digest, Vol 12, Issue 79

Aja Wed, 26 Jan 2011 07:42:22 -0800

  Hi Hendrik,

that's a good question. At least for the LGR policy without forgetting(https://webdisk.lclark.edu/drake/publications/drake-icga-2009.pdf), onlyusing the first appearance of a reply did not significantly differ inperformance.

Thanks for your explanation. Yes, my experiment indicates that theplaying strengh is almost the same.

It's only a few lines of code, test it and see if it makes a differencefor your playout policy and program architecture. Stronger playoutpolicies than Orego's will have different interactions with LGRF. Youcould even try saving several sets of replies per intersection, for thefirst, second, third appearance of the previous move in a playout, in thehope of capturing certain tactical situations with sacrifices. But I don'texpect much.

Indeed. My plan is to generalize this scheme with more strict conditions.Maybe we even can combine LGRF with the information of RAVE (inspired fromArpad Rimmel's works). If the learning works well, it should fix a lot oferrors in my rules of the playout features. This might be a way to make theplayouts to learn how to play correct semeai moves.


 Aja

_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Re: [Computer-go] Computer-go Digest, Vol 12, Issue 79

Reply via email to