[ 
https://issues.apache.org/jira/browse/SINGA-115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sheng Wang resolved SINGA-115.
------------------------------
    Resolution: Fixed
      Assignee: wangwei

> Print layer debug information in the neural net graph file
> ----------------------------------------------------------
>
>                 Key: SINGA-115
>                 URL: https://issues.apache.org/jira/browse/SINGA-115
>             Project: Singa
>          Issue Type: New Feature
>            Reporter: wangwei
>            Assignee: wangwei
>
> It is non-trivial to debug the code for deep learning, e.g., the BP 
> algorithm, the hybrid partitioning and layer implementation. 
> In SINGA, we print the neural net in INFO log as json string, which can be 
> converted into an image with the net graph (nodes are layers). This graph can 
> be used to check the neural net configuration, e.g., layer connection and 
> neural net partitioning. However, it does not collect the run time data, 
> e.g., gradient norm or value norm of each layer, which is important to debug  
> accuracy etc. bugs.
> In this ticket, we will collect the gradient and value norm of each layer and 
> each Param object. These information will be printed as attributes (or 
> sub-nodes) of the layer node in the neural net graph. Users/developers can 
> located the bugs by inspecting the graph after converting the json string 
> into an image.
> Particularly, uses can set the disp_freq to 1 and running steps to a small 
> number, e.g., 5. Then 5 neural net graphs will be printed, one per step. The 
> debug option should be turned on in the job.conf file for printing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to