[jira] [Commented] (MESOS-9560) ContentType/AgentAPITest.MarkResourceProviderGone/1 is flaky
[ https://issues.apache.org/jira/browse/MESOS-9560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16905214#comment-16905214 ] Benjamin Bannier commented on MESOS-9560: - Reopening since we are still observing similar failures. > ContentType/AgentAPITest.MarkResourceProviderGone/1 is flaky > > > Key: MESOS-9560 > URL: https://issues.apache.org/jira/browse/MESOS-9560 > Project: Mesos > Issue Type: Bug > Components: test >Reporter: Benjamin Bannier >Assignee: Benjamin Bannier >Priority: Critical > Labels: flaky, flaky-test, mesosphere, storage, test > Fix For: 1.9.0 > > Attachments: consoleText.txt > > > We observed a segfault in > {{ContentType/AgentAPITest.MarkResourceProviderGone/1}} on test teardown. > {noformat} > I0131 23:55:59.378453 6798 slave.cpp:923] Agent terminating > I0131 23:55:59.378813 31143 master.cpp:1269] Agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 at slave(1112)@172.16.10.236:43229 > (ip-172-16-10-236.ec2.internal) disconnected > I0131 23:55:59.378831 31143 master.cpp:3272] Disconnecting agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 at slave(1112)@172.16.10.236:43229 > (ip-172-16-10-236.ec2.internal) > I0131 23:55:59.378846 31143 master.cpp:3291] Deactivating agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 at slave(1112)@172.16.10.236:43229 > (ip-172-16-10-236.ec2.internal) > I0131 23:55:59.378891 31143 hierarchical.cpp:793] Agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 deactivated > F0131 23:55:59.378891 31149 logging.cpp:67] RAW: Pure virtual method called > @ 0x7f633aaaebdd google::LogMessage::Fail() > @ 0x7f633aab6281 google::RawLog__() > @ 0x7f6339821262 __cxa_pure_virtual > @ 0x55671cacc113 > testing::internal::UntypedFunctionMockerBase::UntypedInvokeWith() > @ 0x55671b532e78 > mesos::internal::tests::resource_provider::MockResourceProvider<>::disconnected() > @ 0x7f633978f6b0 process::AsyncExecutorProcess::execute<>() > @ 0x7f633979f218 > _ZN5cpp176invokeIZN7process8dispatchI7NothingNS1_20AsyncExecutorProcessERKSt8functionIFvvEES9_EENS1_6FutureIT_EERKNS1_3PIDIT0_EEMSE_FSB_T1_EOT2_EUlSt10unique_ptrINS1_7PromiseIS3_EESt14default_deleteISP_EEOS7_PNS1_11ProcessBaseEE_JSS_S7_SV_EEEDTclcl7forwardISB_Efp_Espcl7forwardIT0_Efp0_EEEOSB_DpOSX_ > @ 0x7f633a9f5d01 process::ProcessBase::consume() > @ 0x7f633aa1a08a process::ProcessManager::resume() > @ 0x7f633aa1db06 > _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUlvE_vEEE6_M_runEv > @ 0x7f633acc9f80 execute_native_thread_routine > @ 0x7f6337142e25 start_thread > @ 0x7f6336241bad __clone > {noformat} -- This message was sent by Atlassian JIRA (v7.6.14#76016)
[jira] [Commented] (MESOS-9560) ContentType/AgentAPITest.MarkResourceProviderGone/1 is flaky
[ https://issues.apache.org/jira/browse/MESOS-9560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16783145#comment-16783145 ] Benjamin Bannier commented on MESOS-9560: - Work-in-progress patches posted here, https://github.com/mesosphere/mesos-private/tree/bbannier/t/MESOS-9560. > ContentType/AgentAPITest.MarkResourceProviderGone/1 is flaky > > > Key: MESOS-9560 > URL: https://issues.apache.org/jira/browse/MESOS-9560 > Project: Mesos > Issue Type: Bug > Components: test >Reporter: Benjamin Bannier >Assignee: Benjamin Bannier >Priority: Critical > Labels: flaky, flaky-test, mesosphere, storage, test > Attachments: consoleText.txt > > > We observed a segfault in > {{ContentType/AgentAPITest.MarkResourceProviderGone/1}} on test teardown. > {noformat} > I0131 23:55:59.378453 6798 slave.cpp:923] Agent terminating > I0131 23:55:59.378813 31143 master.cpp:1269] Agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 at slave(1112)@172.16.10.236:43229 > (ip-172-16-10-236.ec2.internal) disconnected > I0131 23:55:59.378831 31143 master.cpp:3272] Disconnecting agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 at slave(1112)@172.16.10.236:43229 > (ip-172-16-10-236.ec2.internal) > I0131 23:55:59.378846 31143 master.cpp:3291] Deactivating agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 at slave(1112)@172.16.10.236:43229 > (ip-172-16-10-236.ec2.internal) > I0131 23:55:59.378891 31143 hierarchical.cpp:793] Agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 deactivated > F0131 23:55:59.378891 31149 logging.cpp:67] RAW: Pure virtual method called > @ 0x7f633aaaebdd google::LogMessage::Fail() > @ 0x7f633aab6281 google::RawLog__() > @ 0x7f6339821262 __cxa_pure_virtual > @ 0x55671cacc113 > testing::internal::UntypedFunctionMockerBase::UntypedInvokeWith() > @ 0x55671b532e78 > mesos::internal::tests::resource_provider::MockResourceProvider<>::disconnected() > @ 0x7f633978f6b0 process::AsyncExecutorProcess::execute<>() > @ 0x7f633979f218 > _ZN5cpp176invokeIZN7process8dispatchI7NothingNS1_20AsyncExecutorProcessERKSt8functionIFvvEES9_EENS1_6FutureIT_EERKNS1_3PIDIT0_EEMSE_FSB_T1_EOT2_EUlSt10unique_ptrINS1_7PromiseIS3_EESt14default_deleteISP_EEOS7_PNS1_11ProcessBaseEE_JSS_S7_SV_EEEDTclcl7forwardISB_Efp_Espcl7forwardIT0_Efp0_EEEOSB_DpOSX_ > @ 0x7f633a9f5d01 process::ProcessBase::consume() > @ 0x7f633aa1a08a process::ProcessManager::resume() > @ 0x7f633aa1db06 > _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUlvE_vEEE6_M_runEv > @ 0x7f633acc9f80 execute_native_thread_routine > @ 0x7f6337142e25 start_thread > @ 0x7f6336241bad __clone > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (MESOS-9560) ContentType/AgentAPITest.MarkResourceProviderGone/1 is flaky
[ https://issues.apache.org/jira/browse/MESOS-9560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16765687#comment-16765687 ] Greg Mann commented on MESOS-9560: -- Observed again with a slightly different stack trace (notice {{connected()}} vs. {{disconnected()}}): {code} PC: @ 0x7f1ed2b30fe6 mesos::v1::resource_provider::Driver::send() *** SIGSEGV (@0x0) received by PID 15010 (TID 0x7f1ec6657700) from PID 0; stack trace: *** @ 0x7f1e9b9765f2 (unknown) @ 0x7f1e9b97ac19 (unknown) @ 0x7f1e9b96dd28 (unknown) @ 0x7f1ecf459390 (unknown) @ 0x7f1ed2b30fe6 mesos::v1::resource_provider::Driver::send() @ 0x563f6934d1ef mesos::internal::tests::resource_provider::MockResourceProvider<>::connectedDefault() @ 0x563f6926cbfe testing::internal::FunctionMockerBase<>::UntypedPerformDefaultAction() @ 0x563f6a788e96 testing::internal::UntypedFunctionMockerBase::UntypedInvokeWith() @ 0x563f692990c4 mesos::internal::tests::resource_provider::MockResourceProvider<>::connected() @ 0x7f1ed2820110 process::AsyncExecutorProcess::execute<>() @ 0x7f1ed282fe58 _ZN5cpp176invokeIZN7process8dispatchI7NothingNS1_20AsyncExecutorProcessERKSt8functionIFvvEES9_EENS1_6FutureIT_EERKNS1_3PIDIT0_EEMSE_FSB_T1_EOT2_EUlSt10unique_ptrINS1_7PromiseIS3_EESt14default_deleteISP_EEOS7_PNS1_11ProcessBaseEE_JSS_S7_SV_EEEDTclcl7forwardISB_Efp_Espcl7forwardIT0_Efp0_EEEOSB_DpOSX_ @ 0x7f1ed3a636c1 process::ProcessBase::consume() @ 0x7f1ed3a85b2a process::ProcessManager::resume() @ 0x7f1ed3a89866 _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUlvE_vEEE6_M_runEv @ 0x7f1ecfc3cc80 (unknown) @ 0x7f1ecf44f6ba start_thread @ 0x7f1ecf18541d (unknown) {code} > ContentType/AgentAPITest.MarkResourceProviderGone/1 is flaky > > > Key: MESOS-9560 > URL: https://issues.apache.org/jira/browse/MESOS-9560 > Project: Mesos > Issue Type: Bug > Components: test >Reporter: Benjamin Bannier >Priority: Critical > Labels: flaky, flaky-test, mesosphere, storage, test > Attachments: consoleText.txt > > > We observed a segfault in > {{ContentType/AgentAPITest.MarkResourceProviderGone/1}} on test teardown. > {noformat} > I0131 23:55:59.378453 6798 slave.cpp:923] Agent terminating > I0131 23:55:59.378813 31143 master.cpp:1269] Agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 at slave(1112)@172.16.10.236:43229 > (ip-172-16-10-236.ec2.internal) disconnected > I0131 23:55:59.378831 31143 master.cpp:3272] Disconnecting agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 at slave(1112)@172.16.10.236:43229 > (ip-172-16-10-236.ec2.internal) > I0131 23:55:59.378846 31143 master.cpp:3291] Deactivating agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 at slave(1112)@172.16.10.236:43229 > (ip-172-16-10-236.ec2.internal) > I0131 23:55:59.378891 31143 hierarchical.cpp:793] Agent > a27bcaba-70cc-4ec3-9786-38f9512c61fd-S0 deactivated > F0131 23:55:59.378891 31149 logging.cpp:67] RAW: Pure virtual method called > @ 0x7f633aaaebdd google::LogMessage::Fail() > @ 0x7f633aab6281 google::RawLog__() > @ 0x7f6339821262 __cxa_pure_virtual > @ 0x55671cacc113 > testing::internal::UntypedFunctionMockerBase::UntypedInvokeWith() > @ 0x55671b532e78 > mesos::internal::tests::resource_provider::MockResourceProvider<>::disconnected() > @ 0x7f633978f6b0 process::AsyncExecutorProcess::execute<>() > @ 0x7f633979f218 > _ZN5cpp176invokeIZN7process8dispatchI7NothingNS1_20AsyncExecutorProcessERKSt8functionIFvvEES9_EENS1_6FutureIT_EERKNS1_3PIDIT0_EEMSE_FSB_T1_EOT2_EUlSt10unique_ptrINS1_7PromiseIS3_EESt14default_deleteISP_EEOS7_PNS1_11ProcessBaseEE_JSS_S7_SV_EEEDTclcl7forwardISB_Efp_Espcl7forwardIT0_Efp0_EEEOSB_DpOSX_ > @ 0x7f633a9f5d01 process::ProcessBase::consume() > @ 0x7f633aa1a08a process::ProcessManager::resume() > @ 0x7f633aa1db06 > _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUlvE_vEEE6_M_runEv > @ 0x7f633acc9f80 execute_native_thread_routine > @ 0x7f6337142e25 start_thread > @ 0x7f6336241bad __clone > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)