Sherry-Zhang commented on issue #10104: [WIP][MXNET-107] Fused RNN 
implementation for CPU
URL: https://github.com/apache/incubator-mxnet/pull/10104#issuecomment-379669403
 
 
   I have finished the fused implementation for LSTM. Multi-layer and 
bidirectional support for it has also been added. When refactoring cudnn 
registration with nnvm interfaces, a segfault is observed in ci environment. We 
cannot reproduce the error on our local server. But seems it is caused by the 
memory sharing mechanism between forward and backward computation. So we 
removed nnvm interfaces from this PR and keep both cpu path and gpu path with 
legacy registration method. Performance table has been updated. @szha , 
@piiswrong , @Jerryzcn , @marcoabreu , @pengzhao-intel , @TaoLv , can you help 
to review this PR?  Any comments are welcome. 

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to