I implemented the ad hoc rule of not training on positions after the first
pass, and my program is basically playing moves until the first pass is forced.
(It is not a “zero” program, so I don’t mind ad hoc rules like this.)
From: Computer-go [mailto:computer-go-boun...@computer-go.org] On
It might make sense to enable resignation threshold even on stupid
level. As such the first thing the network should learn would be not to
resign to early (even before not passing)
Le 02/12/2017 à 18:17, Brian Sheppard via Computer-go a écrit :
>
> I have some hard data now. My network’s initial
I have some hard data now. My network’s initial training reached the same
performance in half the iterations. That is, the steepness of skill gain in the
first day of training was twice as great when I avoided training on fill-ins.
The has all the usual caveats: only one run before/after,