Re: [Computer-go] Training an AlphaGo Zero-like algorithm with limited hardware on 7x7 boards

2020-01-26 Thread cody2007 via Computer-go
Thanks again for your thoughts and experiences Rémi and Igor. I'm still puzzled by what is making training slower for me than Rémi (although I wouldn't be surprised if Igor's results were faster when matched for hardware, model size, strength etc-- see below). Certainly komi sounds like it migh

Re: [Computer-go] Training an AlphaGo Zero-like algorithm with limited hardware on 7x7 boards

2020-01-25 Thread cody2007 via Computer-go
Hi Rémi, Thanks for your comments! I am not using any komi and had not given much thought to it. Although, I suppose by having black win most games, I'm depriving the network of its only learning signal. I will have to try with an appropriately set komi next... >When I started to develop the Z

[Computer-go] Training an AlphaGo Zero-like algorithm with limited hardware on 7x7 boards

2020-01-25 Thread cody2007 via Computer-go
Hi All, I wanted to share an update to a post I wrote last year about using the AlphaGo Zero algorithm on small boards (7x7). I train for approximately 2 months on a single desktop PC with 2 GPU cards. In the article I was getting mediocre performance from the networks. Now, I've found that th

Re: [Computer-go] AlphaZero tensorflow implementation/tutorial

2018-12-09 Thread cody2007 via Computer-go
>> ‐‐‐ Original Message ‐‐‐ >>> On Sunday, December 9, 2018 8:51 PM, uurtamo wrote: >>> >>>> A "scoring estimate" by definition should be weaker than the computer >>>> players it's evaluating until there are no more captures poss

Re: [Computer-go] AlphaZero tensorflow implementation/tutorial

2018-12-09 Thread cody2007 via Computer-go
e: >> >>> A "scoring estimate" by definition should be weaker than the computer >>> players it's evaluating until there are no more captures possible. >>> >>> Yes? >>> >>> s. >>> >>> On Sun, Dec 9, 20

[Computer-go] Fw: Re: AlphaZero tensorflow implementation/tutorial

2018-12-09 Thread cody2007 via Computer-go
ss) ? >> The alphazero paper is not clear about it. >> >> b) Do you need to shuffle batches if you are doing one epoch? Also after >> generating game positions from each game, >> do you shuffle those postions? I found the latter to be very important to >> avoid ov

Re: [Computer-go] AlphaZero tensorflow implementation/tutorial

2018-12-09 Thread cody2007 via Computer-go
t;> By the way, why only 40 moves? That seems like the wrong place to economize, >> but maybe on 7x7 it's fine? >> >> s. >> >> On Sun, Dec 9, 2018, 5:23 PM cody2007 via Computer-go >> > >>> Thanks for your comments. >>> >>>>

Re: [Computer-go] AlphaZero tensorflow implementation/tutorial

2018-12-09 Thread cody2007 via Computer-go
t gnugo, use --play-out-aftermath of gnugo parameter > > If I don't mistake a competitive ai would need a lot more training such what > does leela zero https://github.com/gcp/leela-zero > > Le 10/12/2018 à 01:25, cody2007 via Computer-go a écrit : > >> Hi all, >>

[Computer-go] AlphaZero tensorflow implementation/tutorial

2018-12-09 Thread cody2007 via Computer-go
Hi all, I've posted an implementation of the AlphaZero algorithm and brief tutorial. The code runs on a single GPU. While performance is not that great, I suspect its mostly been limited by hardware limitations (my training and evaluation has been on a single Titan X). The network can beat GNU