This is an automated email from the ASF dual-hosted git repository.
haibin pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git.
from fb73a17 Switch to C++17 and modernize toolchain + CI (#17984)
add e796ae9 Integrate Horovod training API as part of MXNet native
distributed training API (#17531)
No new revisions were added by this update.
Summary of changes:
ci/docker/runtime_functions.sh | 5 +-
.../{cifar10_dist.py => cifar10_kvstore_hvd.py} | 243 ++++++++++++---------
python/mxnet/gluon/trainer.py | 1 +
python/mxnet/kvstore/__init__.py | 1 +
python/mxnet/kvstore/horovod.py | 161 ++++++++++++++
python/mxnet/kvstore/kvstore.py | 3 +
tests/nightly/dist_device_sync_kvstore_horovod.py | 80 +++++++
tests/nightly/test_distributed_training-gpu.sh | 11 +-
tools/launch.py | 63 +++---
9 files changed, 429 insertions(+), 139 deletions(-)
copy example/distributed_training/{cifar10_dist.py => cifar10_kvstore_hvd.py}
(52%)
create mode 100644 python/mxnet/kvstore/horovod.py
create mode 100644 tests/nightly/dist_device_sync_kvstore_horovod.py