haojin2 commented on issue #16951: CentOS GPU tests failing in master URL: https://github.com/apache/incubator-mxnet/issues/16951#issuecomment-560046715 For more info, I've made a change to print the cublaserror's error message out: ``` terminate called after throwing an instance of 'dmlc::Error' what(): [05:07:32] /work/mxnet/include/mshadow/./stream_gpu-inl.h:125: Check failed: err == CUBLAS_STATUS_SUCCESS (7 vs. 0) : Destory cublas handle failed with error CUBLAS_STATUS_INVALID_VALUE Stack trace: [bt] (0) build/tests/mxnet_unit_tests(dmlc::LogMessageFatal::~LogMessageFatal()+0x32) [0x194a5f2] [bt] (1) build/tests/mxnet_unit_tests(mshadow::Stream<mshadow::gpu>::DestroyBlasHandle()+0x14f) [0x1985b2f] [bt] (2) build/tests/mxnet_unit_tests(void mshadow::DeleteStream<mshadow::gpu>(mshadow::Stream<mshadow::gpu>*)+0xb7) [0x1986617] [bt] (3) build/tests/mxnet_unit_tests(mshadow::Stream<mshadow::gpu>* mshadow::NewStream<mshadow::gpu>(bool, bool, int)+0x30b) [0x1986c4b] [bt] (4) build/tests/mxnet_unit_tests(mxnet::test::op::GPUStreamScope::GPUStreamScope(mxnet::OpContext*)+0xfd) [0x198888d] [bt] (5) build/tests/mxnet_unit_tests(std::__shared_ptr<mxnet::test::op::CoreOpExecutor<float, float>, (__gnu_cxx::_Lock_policy)2>::__shared_ptr<std::allocator<mxnet::test::op::CoreOpExecutor<float, float> >, bool, std::vector<mxnet::TShape, std::allocator<mxnet::TShape> > >(std::_Sp_make_shared_tag, std::allocator<mxnet::test::op::CoreOpExecutor<float, float> > const&, bool&&, std::vector<mxnet::TShape, std::allocator<mxnet::TShape> >&&)+0x3c7) [0x19a1e57] [bt] (6) build/tests/mxnet_unit_tests(mxnet::test::OperatorRunner<mxnet::test::op::CoreOpProp, mxnet::test::op::CoreOpExecutor<float, float> >::RunGenericOperatorForward(bool, std::vector<mxnet::TShape, std::allocator<mxnet::TShape> > const&, std::vector<std::pair<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > >, std::allocator<std::pair<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > > > > const&, unsigned long)+0xb6) [0x19a8546] [bt] (7) build/tests/mxnet_unit_tests(ACTIVATION_PERF_ExecuteBidirectional_Test::TestBody()+0x74e) [0x197ebbe] [bt] (8) build/tests/mxnet_unit_tests(void testing::internal::HandleExceptionsInMethodIfSupported<testing::Test, void>(testing::Test*, void (testing::Test::*)(), char const*)+0x43) [0x1ab60d3] ``` The error type is `CUBLAS_STATUS_INVALID_VALUE`.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
