[ 
https://issues.apache.org/jira/browse/MESOS-5023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15212275#comment-15212275
 ] 

Gilbert Song commented on MESOS-5023:
-------------------------------------

This failure comes from a race that the containers_[containerId] is erased 
accidentally. I am attaching the log which is the same flaky by comment out the 
check `CHECK(containers_.contains(containerId));` in _launch():

{noformat}
[==========] Running 1 test from 1 test case.
[----------] Global test environment set-up.
[----------] 1 test from MesosContainerizerProvisionerTest
[ RUN      ] MesosContainerizerProvisionerTest.DestroyWhileProvisioning
I0325 19:01:02.686094 19729 containerizer.cpp:666] Starting container 
'63a1b3d5-dd19-453f-9953-439d4126c488' for executor 'executor' of framework ''
I0325 19:01:02.687070 19732 containerizer.cpp:1421] Destroying container 
'63a1b3d5-dd19-453f-9953-439d4126c488'
I0325 19:01:02.687110 19732 containerizer.cpp:1424] Waiting for the provisioner 
to complete for container '63a1b3d5-dd19-453f-9953-439d4126c488'
F0325 19:01:02.688181 19731 owned.hpp:110] Check failed: 'get()' Must be non 
NULL 
*** Check failure stack trace: ***
    @     0x7f00ae6fe84d  google::LogMessage::Fail()
    @     0x7f00ae6fdc2e  google::LogMessage::SendToLog()
    @     0x7f00ae6fe51d  google::LogMessage::Flush()
    @     0x7f00ae701968  google::LogMessageFatal::~LogMessageFatal()
    @     0x7f00adce4df4  google::CheckNotNull<>()
    @     0x7f00adc83760  process::Owned<>::operator->()
    @     0x7f00adc7154d  
mesos::internal::slave::MesosContainerizerProcess::_launch()
    @     0x7f00adcec610  
_ZZN7process8dispatchIbN5mesos8internal5slave25MesosContainerizerProcessERKNS1_11ContainerIDERK6OptionINS1_8TaskInfoEERKNS1_12ExecutorInfoERKSsRKS8_ISsERKNS1_7SlaveIDERKNS_3PIDINS3_5SlaveEEEbRKS8_INS3_13ProvisionInfoEES5_SA_SD_SsSI_SL_SQ_bSU_EENS_6FutureIT_EERKNSO_IT0_EEMS10_FSZ_T1_T2_T3_T4_T5_T6_T7_T8_T9_ET10_T11_T12_T13_T14_T15_T16_T17_T18_ENKUlPNS_11ProcessBaseEE_clES1P_
    @     0x7f00adcebf32  
_ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchIbN5mesos8internal5slave25MesosContainerizerProcessERKNS5_11ContainerIDERK6OptionINS5_8TaskInfoEERKNS5_12ExecutorInfoERKSsRKSC_ISsERKNS5_7SlaveIDERKNS0_3PIDINS7_5SlaveEEEbRKSC_INS7_13ProvisionInfoEES9_SE_SH_SsSM_SP_SU_bSY_EENS0_6FutureIT_EERKNSS_IT0_EEMS14_FS13_T1_T2_T3_T4_T5_T6_T7_T8_T9_ET10_T11_T12_T13_T14_T15_T16_T17_T18_EUlS2_E_E9_M_invokeERKSt9_Any_dataS2_
    @     0x7f00ae67c188  std::function<>::operator()()
    @     0x7f00ae666494  process::ProcessBase::visit()
    @     0x7f00ae6bdbde  process::DispatchEvent::visit()
    @           0x857a71  process::ProcessBase::serve()
    @     0x7f00ae6641bd  process::ProcessManager::resume()
    @     0x7f00ae66c325  
process::ProcessManager::init_threads()::$_1::operator()()
    @     0x7f00ae66c263  
_ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvE3$_1St17reference_wrapperIKSt11atomic_boolEEE6__callIvJEJLm0EEEET_OSt5tupleIJDpT0_EESt12_Index_tupleIJXspT1_EEE
    @     0x7f00ae66c216  
_ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvE3$_1St17reference_wrapperIKSt11atomic_boolEEEclIJEvEET0_DpOT_
    @     0x7f00ae66c1c5  
_ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvE3$_1St17reference_wrapperIKSt11atomic_boolEEEvEE9_M_invokeIJEEEvSt12_Index_tupleIJXspT_EEE
    @     0x7f00ae66c195  std::_Bind_simple<>::operator()()
    @     0x7f00ae66c16c  std::thread::_Impl<>::_M_run()
    @     0x7f00a9742a60  (unknown)
    @     0x7f00a8f5f182  start_thread
    @     0x7f00a8c8c47d  (unknown)
{noformat}

> MesosContainerizerProvisionerTest.DestroyWhileProvisioning is flaky.
> --------------------------------------------------------------------
>
>                 Key: MESOS-5023
>                 URL: https://issues.apache.org/jira/browse/MESOS-5023
>             Project: Mesos
>          Issue Type: Bug
>            Reporter: Alexander Rukletsov
>            Assignee: Klaus Ma
>              Labels: mesosphere
>
> Observed on the Apache Jenkins.
> {noformat}
> [ RUN      ] MesosContainerizerProvisionerTest.ProvisionFailed
> I0324 13:38:56.284261  2948 containerizer.cpp:666] Starting container 
> 'test_container' for executor 'executor' of framework ''
> I0324 13:38:56.285825  2939 containerizer.cpp:1421] Destroying container 
> 'test_container'
> I0324 13:38:56.285854  2939 containerizer.cpp:1424] Waiting for the 
> provisioner to complete for container 'test_container'
> [       OK ] MesosContainerizerProvisionerTest.ProvisionFailed (7 ms)
> [ RUN      ] MesosContainerizerProvisionerTest.DestroyWhileProvisioning
> I0324 13:38:56.291187  2944 containerizer.cpp:666] Starting container 
> 'c2316963-c6cb-4c7f-a3b9-17ca5931e5b2' for executor 'executor' of framework ''
> I0324 13:38:56.292157  2944 containerizer.cpp:1421] Destroying container 
> 'c2316963-c6cb-4c7f-a3b9-17ca5931e5b2'
> I0324 13:38:56.292179  2944 containerizer.cpp:1424] Waiting for the 
> provisioner to complete for container 'c2316963-c6cb-4c7f-a3b9-17ca5931e5b2'
> F0324 13:38:56.292899  2944 containerizer.cpp:752] Check failed: 
> containers_.contains(containerId)
> *** Check failure stack trace: ***
>     @     0x2ac9973d0ae4  google::LogMessage::Fail()
>     @     0x2ac9973d0a30  google::LogMessage::SendToLog()
>     @     0x2ac9973d0432  google::LogMessage::Flush()
>     @     0x2ac9973d3346  google::LogMessageFatal::~LogMessageFatal()
>     @     0x2ac996af897c  
> mesos::internal::slave::MesosContainerizerProcess::_launch()
>     @     0x2ac996b1f18a  
> _ZZN7process8dispatchIbN5mesos8internal5slave25MesosContainerizerProcessERKNS1_11ContainerIDERK6OptionINS1_8TaskInfoEERKNS1_12ExecutorInfoERKSsRKS8_ISsERKNS1_7SlaveIDERKNS_3PIDINS3_5SlaveEEEbRKS8_INS3_13ProvisionInfoEES5_SA_SD_SsSI_SL_SQ_bSU_EENS_6FutureIT_EERKNSO_IT0_EEMS10_FSZ_T1_T2_T3_T4_T5_T6_T7_T8_T9_ET10_T11_T12_T13_T14_T15_T16_T17_T18_ENKUlPNS_11ProcessBaseEE_clES1P_
>     @     0x2ac996b479d9  
> _ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchIbN5mesos8internal5slave25MesosContainerizerProcessERKNS5_11ContainerIDERK6OptionINS5_8TaskInfoEERKNS5_12ExecutorInfoERKSsRKSC_ISsERKNS5_7SlaveIDERKNS0_3PIDINS7_5SlaveEEEbRKSC_INS7_13ProvisionInfoEES9_SE_SH_SsSM_SP_SU_bSY_EENS0_6FutureIT_EERKNSS_IT0_EEMS14_FS13_T1_T2_T3_T4_T5_T6_T7_T8_T9_ET10_T11_T12_T13_T14_T15_T16_T17_T18_EUlS2_E_E9_M_invokeERKSt9_Any_dataS2_
>     @     0x2ac997334fef  std::function<>::operator()()
>     @     0x2ac99731b1c7  process::ProcessBase::visit()
>     @     0x2ac997321154  process::DispatchEvent::visit()
>     @           0x9a699c  process::ProcessBase::serve()
>     @     0x2ac9973173c0  process::ProcessManager::resume()
>     @     0x2ac99731445a  
> _ZZN7process14ProcessManager12init_threadsEvENKUlRKSt11atomic_boolE_clES3_
>     @     0x2ac997320916  
> _ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS3_EEE6__callIvIEILm0EEEET_OSt5tupleIIDpT0_EESt12_Index_tupleIIXspT1_EEE
>     @     0x2ac9973208c6  
> _ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS3_EEEclIIEvEET0_DpOT_
>     @     0x2ac997320858  
> _ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS4_EEEvEE9_M_invokeIIEEEvSt12_Index_tupleIIXspT_EEE
>     @     0x2ac9973207af  
> _ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS4_EEEvEEclEv
>     @     0x2ac997320748  
> _ZNSt6thread5_ImplISt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS6_EEEvEEE6_M_runEv
>     @     0x2ac9989aea60  (unknown)
>     @     0x2ac999125182  start_thread
>     @     0x2ac99943547d  (unknown)
> make[4]: Leaving directory `/mesos/mesos-0.29.0/_build/src'
> make[4]: *** [check-local] Aborted
> make[3]: *** [check-am] Error 2
> make[3]: Leaving directory `/mesos/mesos-0.29.0/_build/src'
> make[2]: *** [check] Error 2
> make[2]: Leaving directory `/mesos/mesos-0.29.0/_build/src'
> make[1]: *** [check-recursive] Error 1
> make[1]: Leaving directory `/mesos/mesos-0.29.0/_build'
> make: *** [distcheck] Error 1
> Build step 'Execute shell' marked build as failure
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to