Neil Conway created MESOS-7716: ---------------------------------- Summary: Mesos 1.2.0 crashes Mesos 1.4.0 master Key: MESOS-7716 URL: https://issues.apache.org/jira/browse/MESOS-7716 Project: Mesos Issue Type: Bug Reporter: Neil Conway Assignee: Neil Conway
If a Mesos 1.2.0 agent running a task re-registers with a 1.4.0 (-dev) master, the master crashes. Two different crashes: {noformat} F0624 10:50:37.722401 79241216 resources.cpp:1050] Check failed: !resource.has_role() *** Check failure stack trace: *** @ 0x104448a9a google::LogMessage::Fail() @ 0x1044467b5 google::LogMessage::SendToLog() @ 0x104447509 google::LogMessage::Flush() @ 0x104450168 google::LogMessageFatal::~LogMessageFatal() @ 0x104448f85 google::LogMessageFatal::~LogMessageFatal() @ 0x100cf8d5d mesos::Resources::isEmpty() @ 0x100cfb0c3 mesos::Resources::Resource_::isEmpty() @ 0x100cf3791 mesos::Resources::add() @ 0x100d0b2ca mesos::Resources::operator+=() @ 0x100cfbf7b mesos::Resources::operator+=() @ 0x100cfc4ad mesos::Resources::Resources() @ 0x100cfc54d mesos::Resources::Resources() @ 0x10160f143 mesos::internal::master::Slave::addTask() @ 0x101681e3d mesos::internal::master::Slave::Slave() @ 0x101632f84 mesos::internal::master::Slave::Slave() @ 0x101642b3e mesos::internal::master::Master::__reregisterSlave() @ 0x1018cf222 _ZZN7process8dispatchIN5mesos8internal6master6MasterERKNS1_9SlaveInfoERKNS_4UPIDERKNSt3__16vectorINS1_8ResourceENSB_9allocatorISD_EEEERKNSC_INS1_12ExecutorInfoENSE_ISJ_EEEERKNSC_INS1_4TaskENSE_ISO_EEEERKNSC_INS1_13FrameworkInfoENSE_IST_EEEERKNSC_INS2_17Archive_FrameworkENSE_ISY_EEEERKNSB_12basic_stringIcNSB_11char_traitsIcEENSE_IcEEEERKNSC_INS1_20SlaveInfo_CapabilityENSE_IS1A_EEEERKNS_6FutureIbEES5_S8_SG_SL_SQ_SV_S10_S17_S1C_S1G_EEvRKNS_3PIDIT_EEMS1K_FvT0_T1_T2_T3_T4_T5_T6_T7_T8_T9_ET10_T11_T12_T13_T14_T15_T16_T17_T18_T19_ENKUlPNS_11ProcessBaseEE_clES2B_ @ 0x1018cf000 _ZNSt3__128__invoke_void_return_wrapperIvE6__callIJRZN7process8dispatchIN5mesos8internal6master6MasterERKNS5_9SlaveInfoERKNS3_4UPIDERKNS_6vectorINS5_8ResourceENS_9allocatorISG_EEEERKNSF_INS5_12ExecutorInfoENSH_ISM_EEEERKNSF_INS5_4TaskENSH_ISR_EEEERKNSF_INS5_13FrameworkInfoENSH_ISW_EEEERKNSF_INS6_17Archive_FrameworkENSH_IS11_EEEERKNS_12basic_stringIcNS_11char_traitsIcEENSH_IcEEEERKNSF_INS5_20SlaveInfo_CapabilityENSH_IS1D_EEEERKNS3_6FutureIbEES9_SC_SJ_SO_ST_SY_S13_S1A_S1F_S1J_EEvRKNS3_3PIDIT_EEMS1N_FvT0_T1_T2_T3_T4_T5_T6_T7_T8_T9_ET10_T11_T12_T13_T14_T15_T16_T17_T18_T19_EUlPNS3_11ProcessBaseEE_S2E_EEEvDpOT_ @ 0x1018cdab9 _ZNSt3__110__function6__funcIZN7process8dispatchIN5mesos8internal6master6MasterERKNS4_9SlaveInfoERKNS2_4UPIDERKNS_6vectorINS4_8ResourceENS_9allocatorISF_EEEERKNSE_INS4_12ExecutorInfoENSG_ISL_EEEERKNSE_INS4_4TaskENSG_ISQ_EEEERKNSE_INS4_13FrameworkInfoENSG_ISV_EEEERKNSE_INS5_17Archive_FrameworkENSG_IS10_EEEERKNS_12basic_stringIcNS_11char_traitsIcEENSG_IcEEEERKNSE_INS4_20SlaveInfo_CapabilityENSG_IS1C_EEEERKNS2_6FutureIbEES8_SB_SI_SN_SS_SX_S12_S19_S1E_S1I_EEvRKNS2_3PIDIT_EEMS1M_FvT0_T1_T2_T3_T4_T5_T6_T7_T8_T9_ET10_T11_T12_T13_T14_T15_T16_T17_T18_T19_EUlPNS2_11ProcessBaseEE_NSG_IS2E_EEFvS2D_EEclEOS2D_ @ 0x1042934ba std::__1::function<>::operator()() @ 0x1042933fc process::ProcessBase::visit() @ 0x10430063e process::DispatchEvent::visit() @ 0x100918081 process::ProcessBase::serve() @ 0x10428dfd6 process::ProcessManager::resume() @ 0x10436ebb0 process::ProcessManager::init_threads()::$_1::operator()() @ 0x10436e7d2 _ZNSt3__114__thread_proxyINS_5tupleIJZN7process14ProcessManager12init_threadsEvE3$_1EEEEEPvS6_ @ 0x7fffcf49d93b _pthread_body @ 0x7fffcf49d887 _pthread_start @ 0x7fffcf49d08d thread_start {noformat} {noformat} F0623 17:24:17.165977 14985 resources.cpp:1050] Check failed: !resource.has_role() *** Check failure stack trace: *** @ 0x7f2b06c6aedd google::LogMessage::Fail() @ 0x7f2b06c6cc1c google::LogMessage::SendToLog() @ 0x7f2b06c6aacc google::LogMessage::Flush() @ 0x7f2b06c6d519 google::LogMessageFatal::~LogMessageFatal() @ 0x7f2b05df7e76 mesos::Resources::isEmpty() @ 0x7f2b05df7f1c mesos::Resources::add() @ 0x7f2b05df9533 mesos::Resources::operator+=() @ 0x7f2b05df95dd mesos::Resources::operator+=() @ 0x7f2b05df970b mesos::Resources::Resources() @ 0x7f2b060c2651 mesos::internal::master::validation::resource::validateGpus() @ 0x7f2b060c2903 mesos::internal::master::validation::resource::validate() @ 0x7f2b060c330c mesos::internal::master::validation::master::message::reregisterSlave() @ 0x7f2b05fd9812 mesos::internal::master::Master::reregisterSlave() @ 0x7f2b06066a86 ProtobufProcess<>::handler8<>() @ 0x7f2b06008e45 std::_Function_handler<>::_M_invoke() @ 0x7f2b05fb12f3 mesos::internal::master::Master::_visit() @ 0x7f2b05fc9e32 mesos::internal::master::Master::visit() @ 0x7f2b06bef509 process::ProcessManager::resume() @ 0x7f2b06bf9197 _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUt_vEEE6_M_runEv @ 0x7f2b04a8a230 (unknown) @ 0x7f2b04ce5dc5 start_thread @ 0x7f2b041f373d __clone {noformat} -- This message was sent by Atlassian JIRA (v6.4.14#64029)