Thanks again for your thoughts and experiences Rémi and Igor.
I'm still puzzled by what is making training slower for me than Rémi (although
I wouldn't be surprised if Igor's results were faster when matched for
hardware, model size, strength etc-- see below). Certainly komi sounds like it
migh
Hi Rémi,
Thanks for your comments! I am not using any komi and had not given much
thought to it. Although, I suppose by having black win most games, I'm
depriving the network of its only learning signal. I will have to try with an
appropriately set komi next...
>When I started to develop the Z
Hi All,
I wanted to share an update to a post I wrote last year about using the AlphaGo
Zero algorithm on small boards (7x7). I train for approximately 2 months on a
single desktop PC with 2 GPU cards.
In the article I was getting mediocre performance from the networks. Now, I've
found that th
>> ‐‐‐ Original Message ‐‐‐
>>> On Sunday, December 9, 2018 8:51 PM, uurtamo wrote:
>>>
>>>> A "scoring estimate" by definition should be weaker than the computer
>>>> players it's evaluating until there are no more captures poss
e:
>>
>>> A "scoring estimate" by definition should be weaker than the computer
>>> players it's evaluating until there are no more captures possible.
>>>
>>> Yes?
>>>
>>> s.
>>>
>>> On Sun, Dec 9, 20
ss) ?
>> The alphazero paper is not clear about it.
>>
>> b) Do you need to shuffle batches if you are doing one epoch? Also after
>> generating game positions from each game,
>> do you shuffle those postions? I found the latter to be very important to
>> avoid ov
t;> By the way, why only 40 moves? That seems like the wrong place to economize,
>> but maybe on 7x7 it's fine?
>>
>> s.
>>
>> On Sun, Dec 9, 2018, 5:23 PM cody2007 via Computer-go
>> >
>>> Thanks for your comments.
>>>
>>>>
t gnugo, use --play-out-aftermath of gnugo parameter
>
> If I don't mistake a competitive ai would need a lot more training such what
> does leela zero https://github.com/gcp/leela-zero
>
> Le 10/12/2018 à 01:25, cody2007 via Computer-go a écrit :
>
>> Hi all,
>>
Hi all,
I've posted an implementation of the AlphaZero algorithm and brief tutorial.
The code runs on a single GPU. While performance is not that great, I suspect
its mostly been limited by hardware limitations (my training and evaluation has
been on a single Titan X). The network can beat GNU