AlphaGo Zero started with random values in
its neural net - and reached top level
within 72 hours.

Would it typically help or disrupt to start
instead with values that are non-random?
What I have in mind concretely:

Look at 19x19 Go with komi=5.5
In run A you start with random values in the net.
In another run B you start with the values that had
emerged in the 7.5-NN after 72 hours. 

Would typically A or B learn better?
Would there be a danger that B would not be able 
to leave the 7.5-"solution"?

It is a pity that I/we do not have the hardware of 
AlphaGo Zero at hand for such experiments.

Ingo.
_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

Reply via email to