Re: [Computer-go] Nochi: Slightly successful AlphaGo Zero replication

2017-11-15 Thread Gian-Carlo Pascutto
On 11-11-17 00:58, Petr Baudis wrote: >>> * The neural network is updated after _every_ game, _twice_, on _all_ >>> positions plus 64 randomly sampled positions from the entire history, >>> this all done four times - on original position and the three >>> symmetry flips (but I was

Re: [Computer-go] Nochi: Slightly successful AlphaGo Zero replication

2017-11-10 Thread Petr Baudis
On Fri, Nov 10, 2017 at 03:40:27PM +0100, Gian-Carlo Pascutto wrote: > On 10/11/2017 1:47, Petr Baudis wrote: > > > * AlphaGo used 19 resnet layers for 19x19, so I used 7 layers for 7x7. > > How many filters per layer? 256 like AlphaGo. > FWIW 7 layer resnet (14 + 2 layers) is still pretty

Re: [Computer-go] Nochi: Slightly successful AlphaGo Zero replication

2017-11-10 Thread Gian-Carlo Pascutto
On 10/11/2017 1:47, Petr Baudis wrote: > * AlphaGo used 19 resnet layers for 19x19, so I used 7 layers for 7x7. How many filters per layer? FWIW 7 layer resnet (14 + 2 layers) is still pretty huge - larger than the initial AlphaGo. Given the amount of games you have, and the size of the

Re: [Computer-go] Nochi: Slightly successful AlphaGo Zero replication

2017-11-10 Thread Álvaro Begué
It's a model written using the Keras neural network library: https://en.wikipedia.org/wiki/Keras On Fri, Nov 10, 2017 at 7:09 AM, Xavier Combelle wrote: > You make me really curious, > what is a Keras model ? > > Le 10/11/2017 à 01:47, Petr Baudis a écrit : > > Hi,

Re: [Computer-go] Nochi: Slightly successful AlphaGo Zero replication

2017-11-10 Thread Darren Cook
> You make me really curious, what is a Keras model ? When I was a lad, you had to bike 3 miles (uphill in both directions) to the library to satisfy curiosity. Nowadays you just type "keras" into Google ;-) https://keras.io/ Darren ___ Computer-go

Re: [Computer-go] Nochi: Slightly successful AlphaGo Zero replication

2017-11-10 Thread Xavier Combelle
You make me really curious, what is a Keras model ? Le 10/11/2017 à 01:47, Petr Baudis a écrit : > Hi, > > I got first *somewhat* positive results in my attempt to reproduce > AlphaGo Zero - 25% winrate against GNUGo on the easiest reasonable task > - 7x7 board. :) a.k.a. > >

Re: [Computer-go] Nochi: Slightly successful AlphaGo Zero replication

2017-11-10 Thread Petr Baudis
On Fri, Nov 10, 2017 at 01:47:17AM +0100, Petr Baudis wrote: > This is a truly "zero-knowledge" system like AlphaGo Zero - it needs > no supervision, and it contains no Monte Carlo simulations or other > heuristics. But it's not entirely 1:1, I did some tweaks which I thought > might help early

Re: [Computer-go] Nochi: Slightly successful AlphaGo Zero replication

2017-11-09 Thread Ingo Althöfer
mputer-go@computer-go.org > Betreff: [Computer-go] Nochi: Slightly successful AlphaGo Zero replication > > Hi, > > I got first *somewhat* positive results in my attempt to reproduce > AlphaGo Zero - 25% winrate against GNUGo on the easiest reasonable task > - 7x7 board.

[Computer-go] Nochi: Slightly successful AlphaGo Zero replication

2017-11-09 Thread Petr Baudis
Hi, I got first *somewhat* positive results in my attempt to reproduce AlphaGo Zero - 25% winrate against GNUGo on the easiest reasonable task - 7x7 board. :) a.k.a. "Sometimes beating GNUGo on a tiny board" without human knowledge (much wow!) Normally this would be a pretty