zdaxie commented on issue #11777: Distributed Training: looks like async from the log although setting the kv-store=dist_device_sync URL: https://github.com/apache/incubator-mxnet/issues/11777#issuecomment-407966926 Is there any solutions for this issue so far? The former version of mxnet works fine in synchronization and accuracy, only the last epoch part still got stuck.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
