nacorti commented on issue #15320: Weird C++ Error / Bug when calling asnumpy() or exporting the weight of darknet53 while training URL: https://github.com/apache/incubator-mxnet/issues/15320#issuecomment-532857192 Had that problem already, this is somehow different. This occurs when I try to run `for i, batch in enumerate(train_loader):` on a DataLoader object. I'm using AWS Sagemaker, and this enumerate command succeeds when I'm running in a notebook on an ml.t2.medium instance but fails when I try to run on an ml.p2.xlarge instance, after outputting some malloc lines that don't appear when I'm running on ml.t2.medium `src/operator/nn/mkldnn/mkldnn_base.cc:74: Allocate 22151168 bytes with malloc directly src/operator/nn/mkldnn/mkldnn_base.cc:74: Allocate 73728 bytes with malloc directly` I have a feeling this has something to do with underlying architecture issues, but it also might be an issue of me using a custom dataset.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
