Neil Conway created MESOS-4739:
----------------------------------
Summary: libprocess CHECK failure in
SlaveRecoveryTest/0.ReconnectHTTPExecutor
Key: MESOS-4739
URL: https://issues.apache.org/jira/browse/MESOS-4739
Project: Mesos
Issue Type: Bug
Components: HTTP API, libprocess
Reporter: Neil Conway
{noformat}
[ RUN ] SlaveRecoveryTest/0.ReconnectHTTPExecutor
I0223 09:38:55.434953 11158 executor.cpp:172] Version: 0.28.0
Received a SUBSCRIBED event
Starting task 1
Finishing task 1
Received an ERROR event
Received an ERROR event
E0223 09:38:55.504820 11159 executor.cpp:553] End-Of-File received from agent.
The agent closed the event stream
Received an ERROR event
Received an ERROR event
Received an ERROR event
F0223 09:39:00.535778 22159 process.cpp:1114] Check failed: items.size() > 0
*** Check failure stack trace: ***
Received an ERROR event
Received an ERROR event
@ 0x7f4affd0e754 google::LogMessage::Fail()
Received an ERROR event
Received an ERROR event
Received an ERROR event
Received an ERROR event
@ 0x7f4affd0e6ad google::LogMessage::SendToLog()
@ 0x7f4affd0e0a3 google::LogMessage::Flush()
@ 0x7f4affd10f14 google::LogMessageFatal::~LogMessageFatal()
@ 0x7f4affc618d4 process::HttpProxy::waited()
@ 0x7f4affc8f57f
_ZZN7process8dispatchINS_9HttpProxyERKNS_6FutureINS_4http8ResponseEEES5_EEvRKNS_3PIDIT_EEMS9_FvT0_ET1_ENKUlPNS_11ProcessBaseEE_clESI_
@ 0x7f4affcac946
_ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchINS0_9HttpProxyERKNS0_6FutureINS0_4http8ResponseEEES9_EEvRKNS0_3PIDIT_EEMSD_FvT0_ET1_EUlS2_E_E9_M_invokeERKSt9_Any_dataOS2_
@ 0x7f4affc89961 std::function<>::operator()()
@ 0x7f4affc6ef02 process::ProcessBase::visit()
@ 0x7f4affc74e52 process::DispatchEvent::visit()
@ 0xa3afe8 process::ProcessBase::serve()
@ 0x7f4affc6b073 process::ProcessManager::resume()
@ 0x7f4affc6813b
_ZZN7process14ProcessManager12init_threadsEvENKUlRKSt6atomicIbEE_clES4_
@ 0x7f4affc745fa
_ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS4_EEE6__callIvJEJLm0EEEET_OSt5tupleIJDpT0_EESt12_Index_tupleIJXspT1_EEE
@ 0x7f4affc745a8
_ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS4_EEEclIJEvEET0_DpOT_
@ 0x7f4affc74556
_ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS5_EEEvEE9_M_invokeIJEEEvSt12_Index_tupleIJXspT_EEE
@ 0x7f4affc744bf
_ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS5_EEEvEEclEv
@ 0x7f4affc7445e
_ZNSt6thread5_ImplISt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS7_EEEvEEE6_M_runEv
@ 0x7f4afa6ddc40 execute_native_thread_routine
@ 0x7f4afadba424 start_thread
@ 0x7f4af9e50cbd __clone
@ (nil) (unknown)
Aborted (core dumped)
{noformat}
This crash was observed in a recent ArchLinux VM (Virtualbox), running
concurrently with {{stress --cpu 4}}. Repro'd with {{./src/mesos-tests
--gtest_filter="SlaveRecovery*" --gtest_repeat=100 --gtest_break_on_failure}};
took about 20 iterations to trigger a crash.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)