AlphaGo's zero plane of the policy network is used as the color 
feature for the value network (Extended Data Table 2, page 31).  
These networks share the same architecture so that the value 
network can be initialized by the policy network before 
training.

Hideki

Brian Lee: <CAGCyZsfZyRtA-P+B4G8f+E=to-cu6x2x4vbxaempmqpd4m2...@mail.gmail.com>:
>I've been wondering about something I've seen in a few papers (AlphaGo's
>paper, Cazenave's resnet policy architecture), which is the presence of an
>input plane filled with 0s.
>
>The input features also typically include a plane of 1s, which makes sense
>to me - zero-padding before a convolution means that the 0/1 demarcation
>line tells the CNN where the edge of the board is. But as far as I can
>tell, a plane of constant 0s should do absolutely nothing. Can anyone
>enlighten me?
>---- inline file
>_______________________________________________
>Computer-go mailing list
>Computer-go@computer-go.org
>http://computer-go.org/mailman/listinfo/computer-go
-- 
Hideki Kato <mailto:hideki_ka...@ybb.ne.jp>
_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

Reply via email to