Re: [Computer-go] Our Silicon Overlord

Robert Jasiek Thu, 05 Jan 2017 22:55:09 -0800

On 05.01.2017 17:32, Jim O'Flaherty wrote:

I don't follow.

1) "For each arcane position reached, there would now be ample data forAlphaGo to train on that particular pathway." is false. See below.

2) "two strategies. The first would be to avoid the state in the firstplace." Does AlphaGo have any strategy ever? If it does, does it havestrategies of avoiding certain types of positions?

3) "the second would be to optimize play in that particular state." Ifyou mean optimise play = maximise winning probability.

But... optimising this is hard when (under positional superko) optimalplay can be ca. 13,500,000 moves long and the tree to that is huge. EvenTPU sampling can be lost then.

Afterwards, there is still only one position from which to train. For NNlearning, one position is not enough and cannot replace analysis bymathematical proofs ALA the NN does not emulate mathematical proving.


--
robert jasiek
_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

Re: [Computer-go] Our Silicon Overlord

Reply via email to