I was writing code along those lines when AlphaGo debuted. When it became clear that AlphaGo had succeeded, then I ceased work.
So I don’t know whether this strategy will succeed, but the theoretical merits were good enough to encourage me. Best of luck, Brian From: Computer-go [mailto:[email protected]] On Behalf Of Bo Peng Sent: Tuesday, January 10, 2017 5:25 PM To: [email protected] Subject: [Computer-go] Training the value network (a possibly more efficient approach) Hi everyone. It occurs to me there might be a more efficient method to train the value network directly (without using the policy network). You are welcome to check my method: http://withablink.com/GoValueFunction.pdf Let me know if there is any silly mistakes :)
_______________________________________________ Computer-go mailing list [email protected] http://computer-go.org/mailman/listinfo/computer-go
