haojin2 commented on issue #16951: CentOS GPU tests failing in master
URL: 
https://github.com/apache/incubator-mxnet/issues/16951#issuecomment-560046715
 
 
   For more info, I've made a change to print the cublaserror's error message 
out:
   ```
   terminate called after throwing an instance of 'dmlc::Error'
   
     what():  [05:07:32] /work/mxnet/include/mshadow/./stream_gpu-inl.h:125: 
Check failed: err == CUBLAS_STATUS_SUCCESS (7 vs. 0) : Destory cublas handle 
failed with error CUBLAS_STATUS_INVALID_VALUE
   
   Stack trace:
   
     [bt] (0) 
build/tests/mxnet_unit_tests(dmlc::LogMessageFatal::~LogMessageFatal()+0x32) 
[0x194a5f2]
   
     [bt] (1) 
build/tests/mxnet_unit_tests(mshadow::Stream<mshadow::gpu>::DestroyBlasHandle()+0x14f)
 [0x1985b2f]
   
     [bt] (2) build/tests/mxnet_unit_tests(void 
mshadow::DeleteStream<mshadow::gpu>(mshadow::Stream<mshadow::gpu>*)+0xb7) 
[0x1986617]
   
     [bt] (3) build/tests/mxnet_unit_tests(mshadow::Stream<mshadow::gpu>* 
mshadow::NewStream<mshadow::gpu>(bool, bool, int)+0x30b) [0x1986c4b]
   
     [bt] (4) 
build/tests/mxnet_unit_tests(mxnet::test::op::GPUStreamScope::GPUStreamScope(mxnet::OpContext*)+0xfd)
 [0x198888d]
   
     [bt] (5) 
build/tests/mxnet_unit_tests(std::__shared_ptr<mxnet::test::op::CoreOpExecutor<float,
 float>, 
(__gnu_cxx::_Lock_policy)2>::__shared_ptr<std::allocator<mxnet::test::op::CoreOpExecutor<float,
 float> >, bool, std::vector<mxnet::TShape, std::allocator<mxnet::TShape> > 
>(std::_Sp_make_shared_tag, 
std::allocator<mxnet::test::op::CoreOpExecutor<float, float> > const&, bool&&, 
std::vector<mxnet::TShape, std::allocator<mxnet::TShape> >&&)+0x3c7) [0x19a1e57]
   
     [bt] (6) 
build/tests/mxnet_unit_tests(mxnet::test::OperatorRunner<mxnet::test::op::CoreOpProp,
 mxnet::test::op::CoreOpExecutor<float, float> 
>::RunGenericOperatorForward(bool, std::vector<mxnet::TShape, 
std::allocator<mxnet::TShape> > const&, 
std::vector<std::pair<std::__cxx11::basic_string<char, std::char_traits<char>, 
std::allocator<char> >, std::__cxx11::basic_string<char, 
std::char_traits<char>, std::allocator<char> > >, 
std::allocator<std::pair<std::__cxx11::basic_string<char, 
std::char_traits<char>, std::allocator<char> >, 
std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > 
> > > const&, unsigned long)+0xb6) [0x19a8546]
   
     [bt] (7) 
build/tests/mxnet_unit_tests(ACTIVATION_PERF_ExecuteBidirectional_Test::TestBody()+0x74e)
 [0x197ebbe]
   
     [bt] (8) build/tests/mxnet_unit_tests(void 
testing::internal::HandleExceptionsInMethodIfSupported<testing::Test, 
void>(testing::Test*, void (testing::Test::*)(), char const*)+0x43) [0x1ab60d3]
   ```
   The error type is `CUBLAS_STATUS_INVALID_VALUE`.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to