On Mon, Nov 09, 2009 at 10:18:25AM -0800, Peter Drake wrote:
> I'm actually looking for something weaker than what Olivier has
> offered: a published report of the empirical finding that (for some
> programs, at least) an exploration coefficient of zero works best.
I think you could use "Combining expert, offline, transient and online
knowledge in Monte-Carlo exploration" for that, since there is presented
an AMAF equation without any exploration term, and the final equation
has no exploration term either.
--
Petr "Pasky" Baudis
A lot of people have my books on their bookshelves.
That's the problem, they need to read them. -- Don Knuth
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/