stephenrawls commented on issue #15278: fixing var-seq-len rnn backward() 
operator
URL: https://github.com/apache/incubator-mxnet/pull/15278#issuecomment-503774511
 
 
   Just to keep the ticket updated:
   
   I have confirmed the following facts:
   
   1. If I set each sequence_length entry to the maximum sequence length, then 
my gradients between the reference net and the var-seq-len net do match
   2. When I set cudnn debugging on, I *am* calling the appropriate "unpacked 
enabled" version of the cudnn api and the appropriate seq-len values are passed 
in.
   i.e. I set:
   ```
   export CUDNN_LOGINFO_DBG=1
   export CUDNN_LOGDEST_DBG=/home/ec2-user/cudnn.dbg.log
   ```
   And I look at the resulting output and see:
   
   I! CuDNN (v7501) function cudnnRNNForwardTrainingEx() called:
   ...
   layout: type=cudnnRNNDataLayout_t; 
val=CUDNN_RNN_DATA_LAYOUT_SEQ_MAJOR_UNPACKED (0);

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to