[GitHub] [incubator-mxnet] matteosal commented on issue #21111: cuDNN batchnorm behaviour is not consistent and it can output nan

GitBox Fri, 02 Sep 2022 09:47:48 -0700


matteosal commented on issue #21111:
URL: 
https://github.com/apache/incubator-mxnet/issues/21111#issuecomment-1235713824


   So, to summarize the issues I found with the cuDNN implementation:
   
   1) Moving arrays are normally updated only if a BACKWARD pass in training 
mode is performed, but on GPU + cudnn they are changed by the FORWARD
   2) In training mode, all implementations compute the **biased** data 
variance during the forward but the cuDNN implementation uses the **unbiased** 
data variance to update the moving variance
   
   So the cuDNN implementation updates the moving variance using a different 
value (the unbiased one) and also at a different time (during the forward)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@mxnet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@mxnet.apache.org
For additional commands, e-mail: issues-h...@mxnet.apache.org

[GitHub] [incubator-mxnet] matteosal commented on issue #21111: cuDNN batchnorm behaviour is not consistent and it can output nan

Reply via email to