roywei edited a comment on issue #15152: [CI][nightly] nightly test tutorial
failure: test_tutorials.test_python_kvstore
URL:
https://github.com/apache/incubator-mxnet/issues/15152#issuecomment-499193612
root cause is num of GPUs changed from 2 to 1. `NODE_LINUX_GPU` is G3.8x
wiht 2 GPUs
roywei edited a comment on issue #15152: [CI][nightly] nightly test tutorial
failure: test_tutorials.test_python_kvstore
URL:
https://github.com/apache/incubator-mxnet/issues/15152#issuecomment-498939914
Current conclusion is this only happens on CI machines with
`NODE_LINUX_GPU_P3`
roywei edited a comment on issue #15152: [CI][nightly] nightly test tutorial
failure: test_tutorials.test_python_kvstore
URL:
https://github.com/apache/incubator-mxnet/issues/15152#issuecomment-498939531
However, I was not able to reproduce this on a EC2 P3.8xLarge instance.
All
roywei edited a comment on issue #15152: [CI][nightly] nightly test tutorial
failure: test_tutorials.test_python_kvstore
URL:
https://github.com/apache/incubator-mxnet/issues/15152#issuecomment-498939531
However, I was not able to reproduce this on a EC2 P3.8xLarge instance.
All
roywei edited a comment on issue #15152: [CI][nightly] nightly test tutorial
failure: test_tutorials.test_python_kvstore
URL:
https://github.com/apache/incubator-mxnet/issues/15152#issuecomment-498939531
However, I was not able to reproduce this on a EC2 P3.8xLarge instance.
All
roywei edited a comment on issue #15152: [CI][nightly] nightly test tutorial
failure: test_tutorials.test_python_kvstore
URL:
https://github.com/apache/incubator-mxnet/issues/15152#issuecomment-498938781
This tutorial test was passing when running on 1 GPU machine.