[
https://issues.apache.org/jira/browse/MESOS-7686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16051982#comment-16051982
]
Jack Crawford commented on MESOS-7686:
--------------------------------------
after further searching, i see that this is a configuration option. It might be
nice to mention that in the error like "Failed to perform store within 20secs,
consider increasing registry_store_timeout", or similar
> registrar aborting, failed to mark agent causes fatal error
> -----------------------------------------------------------
>
> Key: MESOS-7686
> URL: https://issues.apache.org/jira/browse/MESOS-7686
> Project: Mesos
> Issue Type: Bug
> Reporter: Jack Crawford
>
> Mesos master fails to update registrar and fails.
> Running with mesos 1.2.0, 1 master
> {code}
> Jun 16 02:20:22 mesosmaster mesos-master[9415]: E0616 02:20:22.562098 9422
> registrar.cpp:528] Registrar aborting: Failed to update registry: Failed to
> perform store within 20secs
> Jun 16 02:20:32 mesosmaster mesos-master[9415]: F0616 02:20:32.965498 9419
> master.cpp:6420] Failed to mark agent
> acc02700-53c8-4961-b8f4-d952e58432c3-S742 at slave(1)@10.0.239.60:5051
> (ip-10-0-239-60) un
> Jun 16 02:20:53 mesosmaster mesos-master[9415]: E0616 02:20:36.198673 9426
> process.cpp:2426] Failed to shutdown socket with fd 18: Transport endpoint is
> not connected
> Jun 16 02:20:53 mesosmaster mesos-master[9415]: *** Check failure stack
> trace: ***
> Jun 16 02:21:24 mesosmaster mesos-master[9415]: @ 0x7f696c7923cd
> google::LogMessage::Fail()
> Jun 16 02:21:34 mesosmaster mesos-master[9415]: @ 0x7f696c794180
> google::LogMessage::SendToLog()
> Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c791fb3
> google::LogMessage::Flush()
> Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c794ba9
> google::LogMessageFatal::~LogMessageFatal()
> Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696bb48e6d
> mesos::internal::master::Master::_markUnreachable()
> Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c6f1f0c
> process::ProcessBase::visit()
> Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c704933
> process::ProcessManager::resume()
> Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c70f537
> _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUt_vEEE6_M_runEv
> Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696a950c80
> (unknown)
> Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696a1636ba
> start_thread
> Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f6969e9982d
> (unknown)
> Jun 16 02:23:40 mesosmaster systemd[1]: mesos-master.service: Main process
> exited, code=killed, status=6/ABRT
> Jun 16 02:23:40 mesosmaster systemd[1]: mesos-master.service: Unit entered
> failed state.
> {code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)