rahul003 commented on issue #10696: [MXNET-366]Extend MXNet Distributed Training by MPI AllReduce URL: https://github.com/apache/incubator-mxnet/pull/10696#issuecomment-386408856 About the numbers above 'Local Batch Size: 64' refers to 64 across many GPUs? Can you run a benchmark with higher batch size like 512 with 8 gpus for resnet50 imagenet and compare the scalability?
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
