Has anyone ever tried to build a value network that is trained on finished positions? I admit that would be less awesome than what AlphaGo's value network has achieved. But reducing the task to status and scoring might help in endgame play. Generalizing this, there could be several value networks, each specialized on a different stage of the game.
_______________________________________________ Computer-go mailing list Computer-go@computer-go.org http://computer-go.org/mailman/listinfo/computer-go