Now it raises a really weird "CUDA Misaligned Memory Error". I currently having no idea what triggers it. Actually it happens when we initialize the ret_mask to all zero.
[ Full content available at: https://github.com/apache/incubator-mxnet/pull/12446 ] This message was relayed via gitbox.apache.org for [email protected]
