The discussion changed the subject to joseki, but the
paper is not about joseki at all. (There is a poster
about joseki in the same website.)

The "Power of Forgetting" is an improvement to the
previous Last-Good Reply idea.

The results are spectacular and implementation is
super-simple. Looks like RAVE applied to playouts,
the simple heuristic that beats more ambitious ideas.

It improves a MoGo-like policy: (capture, escape, 3x3)
from ~10% to ~35% with 8K playouts and from ~25% to ~65%
with 32K playouts. (Winrate against GnuGo) in 19x19!

Something i guess, everybody will want to try.

I will.


Jacques.

_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Reply via email to