Has anyone ever tried to build a value network that is trained on finished
positions?
I admit that would be less awesome than what AlphaGo's value network has
achieved.
But reducing the task to status and scoring might help in endgame play.
Generalizing this, there could be several value networks, each specialized
on a different stage of the game.
_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

Reply via email to