I asked the same question [here](https://github.com/apache/incubator-mxnet/issues/20531) and found out that the array is created after roughly 15 min.
The problem can be fixed by creating an environment variable "CUDA_CACHE_MAXSIZE". 1 GiB was a good value, at least for me. After this, the 15 min wait only occurs the first time a GPU is used and the subsequent runs are a lot faster, even if python is closed in between. --- [Visit Topic](https://discuss.mxnet.apache.org/t/creating-a-ndarray-on-gpu-does-not-finish/6995/9) or reply to this email to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.mxnet.apache.org/email/unsubscribe/7d6c5684eb309631b3feb58db299c14bc1bfe0630fa213af708e3409aa4e7c44).
