xcgoner commented on issue #11763: When Train SSD, It hold on during read the data URL: https://github.com/apache/incubator-mxnet/issues/11763#issuecomment-408318271 @burness Understood. My concern is that sometimes using multiple worker threads in the dataloader may exhaust the cpu resources and make the training stuck. You can try to set this option to be 1 or even 0 (0 will trigger a single-thread dataloader in a different mode) and check whether it is still stuck or not. So that we can identify what is exactly the problem there. By the way, I noticed that in your snapshot, there is a typo. It should be "num-workers" instead of "num-worker"
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
