More efficiency is always great. But im having trouble following the abstraction to 2 or 3 axes. Can you elaborate some more on the approach/implementation?
[ Full content available at: https://github.com/apache/incubator-mxnet/pull/12430 ] This message was relayed via gitbox.apache.org for [email protected]
