[GitHub] [incubator-mxnet] nttstar opened a new issue #18475: BatchNorm can not converge with scale=False

GitBox Tue, 02 Jun 2020 19:31:46 -0700


nttstar opened a new issue #18475:
URL: https://github.com/apache/incubator-mxnet/issues/18475



   ## Description
    BatchNorm operator with ``scale=False`` can not converge.
   
   ### Error Message
   No error message, but loss value and training accuracy is abnormal comparing 
with ``scale=True`` BatchNorm.
   
   ## To Reproduce
   We can try ``https://github.com/nttstar/arcface.np`` to train arcface. Add 
one BatchNorm op with ``scale=False`` after final embedding layer
   
   
   ## What have you tried to solve it?
   
   1. Set ``Scale=True``, it can work but with slightly worse test accuracy.
   
   ## Environment
   
   ----------Python Info----------
   Version      : 3.6.9
   Compiler     : GCC 7.3.0
   Build        : ('default', 'Jul 30 2019 19:07:31')
   Arch         : ('64bit', '')
   ------------Pip Info-----------
   Version      : 19.3.1
   Directory    : /root/anaconda2/envs/py36/lib/python3.6/site-packages/pip
   ----------MXNet Info-----------
   Version      : 2.0.0
   Directory    : /root/anaconda2/envs/py36/lib/python3.6/site-packages/mxnet
   Num GPUs     : 8
   Hashtag not found. Not installed from pre-built package.
   ----------System Info----------
   Platform     : Linux-3.10.0-327.el7.x86_64-x86_64-with-centos-7.5.1804-Core
   system       : Linux
   node         : gpu06
   release      : 3.10.0-327.el7.x86_64
   version      : #1 SMP Thu Nov 19 22:10:57 UTC 2015
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [incubator-mxnet] nttstar opened a new issue #18475: BatchNorm can not converge with scale=False

Reply via email to