barry-jin opened a new issue #19056:
URL: https://github.com/apache/incubator-mxnet/issues/19056


   ## Error when training PSPNet on Cityscapes dataset using GluonCV #17439
   
   ### Problem Description
   The problem is when I train a PSPNet using GluonCV semantic segmentation 
library on the Cityscapes dataset, the training will stuck (hang) right after 
it started. 
   
   ### Debugging
   After bisect the date of failure, I find the first bad commit is [PR 
13896](https://github.com/apache/incubator-mxnet/pull/13896), which introduced 
this problem. 
   
   ## Proposed solutions
   Turn off CuDNN by setting `cudnn_off` to `True` in 
[Dropout](https://github.com/apache/incubator-mxnet/blob/9b22c8c2e935cd42ff0f7d339a4b790f5b3367b6/python/mxnet/gluon/nn/basic_layers.py#L271)
   
   ## References
   - list reference and related literature 
   [Issue #17439](https://github.com/apache/incubator-mxnet/issues/17439), [PR 
#13896](https://github.com/apache/incubator-mxnet/pull/13896)
   - list known implementations
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to