[GitHub] [incubator-mxnet] roywei commented on issue #15152: [CI][nightly] nightly test tutorial failure: test_tutorials.test_python_kvstore

2019-06-05 Thread GitBox
roywei commented on issue #15152: [CI][nightly] nightly test tutorial failure: test_tutorials.test_python_kvstore URL: https://github.com/apache/incubator-mxnet/issues/15152#issuecomment-499193612 root cause is num of GPUs changed from 2 to 1. `NODE_LINUX_GPU` is G3.8x wiht 2 GPUs and

[GitHub] [incubator-mxnet] roywei commented on issue #15152: [CI][nightly] nightly test tutorial failure: test_tutorials.test_python_kvstore

2019-06-05 Thread GitBox
roywei commented on issue #15152: [CI][nightly] nightly test tutorial failure: test_tutorials.test_python_kvstore URL: https://github.com/apache/incubator-mxnet/issues/15152#issuecomment-499149917 able to reproduce now on 3.2xlarge

[GitHub] [incubator-mxnet] roywei commented on issue #15152: [CI][nightly] nightly test tutorial failure: test_tutorials.test_python_kvstore

2019-06-04 Thread GitBox
roywei commented on issue #15152: [CI][nightly] nightly test tutorial failure: test_tutorials.test_python_kvstore URL: https://github.com/apache/incubator-mxnet/issues/15152#issuecomment-498939914 Current conclusion is this only happens on CI machines with `NODE_LINUX_GPU_P3`

[GitHub] [incubator-mxnet] roywei commented on issue #15152: [CI][nightly] nightly test tutorial failure: test_tutorials.test_python_kvstore

2019-06-04 Thread GitBox
roywei commented on issue #15152: [CI][nightly] nightly test tutorial failure: test_tutorials.test_python_kvstore URL: https://github.com/apache/incubator-mxnet/issues/15152#issuecomment-498939531 However, I was not able to reproduce this on a EC2 P3.8xLarge instance. All tutorial test

[GitHub] [incubator-mxnet] roywei commented on issue #15152: [CI][nightly] nightly test tutorial failure: test_tutorials.test_python_kvstore

2019-06-04 Thread GitBox
roywei commented on issue #15152: [CI][nightly] nightly test tutorial failure: test_tutorials.test_python_kvstore URL: https://github.com/apache/incubator-mxnet/issues/15152#issuecomment-498938781 This tutorial test was passing when running on 1 GPU machine.