leezu commented on issue #18025:
URL: https://github.com/apache/incubator-mxnet/pull/18025#issuecomment-617994912


   This does not seem to be a stability issue, but a bug. Same failure again:
   
   ```
   [2020-04-22T19:21:39.589Z] Traceback (most recent call last):
   
   [2020-04-22T19:21:39.589Z]   File "dist_device_sync_kvstore.py", line 130, 
in <module>
   
   [2020-04-22T19:21:39.589Z]     test_sync_push_pull()
   
   [2020-04-22T19:21:39.589Z]   File "dist_device_sync_kvstore.py", line 88, in 
test_sync_push_pull
   
   [2020-04-22T19:21:39.589Z]     check_default_keys(kv, my_rank, nworker, 
nrepeat=3)
   
   [2020-04-22T19:21:39.589Z]   File "dist_device_sync_kvstore.py", line 72, in 
check_default_keys
   
   [2020-04-22T19:21:39.589Z]     check_diff_to_scalar(val, num)
   
   [2020-04-22T19:21:39.589Z]   File "dist_device_sync_kvstore.py", line 29, in 
check_diff_to_scalar
   
   [2020-04-22T19:21:39.589Z]     assert(np.sum(np.abs((A - x).asnumpy())) == 
0), (rank, A.asnumpy(), x)
   
   [2020-04-22T19:21:39.589Z] AssertionError: (None, array([[0.6, 0.6, 0.6],
   
   [2020-04-22T19:21:39.589Z]        [0.6, 0.6, 0.6]], dtype=float32), 41.0)
   ```
   
   @szha 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to