On 25-10-17 16:00, Petr Baudis wrote: >> The original paper has the value they used. But this likely needs tuning. I >> would tune with a supervised network to get started, but you need games for >> that. Does it even matter much early on? The network is random :) > > The network actually adapts quite rapidly initially, in my experience. > (Doesn't mean it improves - it adapts within local optima of the few > games it played so far.)
Yes, but once there's structure, you can tune the parameter with CLOP or whatever. > Yes, but why wouldn't you want that randomness in the second or third > move? You only need to play a different move at the root in order for the game to deviate. -- GCP _______________________________________________ Computer-go mailing list Computer-go@computer-go.org http://computer-go.org/mailman/listinfo/computer-go