Operators are stateless, but, I remember there is a optimization switch, that enables saving data from forward pass to be used in backward pass to make the computation faster. @azai91 - Can you please help here?
[ Full content available at: https://github.com/apache/incubator-mxnet/issues/10840 ] This message was relayed via gitbox.apache.org for [email protected]
