Sherry-Zhang commented on issue #10104: [WIP][MXNET-107] Fused RNN implementation for CPU URL: https://github.com/apache/incubator-mxnet/pull/10104#issuecomment-379669403 I have finished the fused implementation for LSTM. Multi-layer and bidirectional support for it has also been added. When refactoring cudnn registration with nnvm interfaces, a segfault is observed in ci environment. We cannot reproduce the error on our local server. But seems it is caused by the memory sharing mechanism between forward and backward computation. So we removed nnvm interfaces from this PR and keep both cpu path and gpu path with legacy registration method. Performance update is shown in below table. @szha , @piiswrong , @Jerryzcn , @marcoabreu , @pengzhao-intel , @TaoLv , can you help to review this PR? Any comments are welcome.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services