Jan Schlicht created MESOS-4673:
-----------------------------------
Summary: Agent fails to register after re-registering period
timed-out.
Key: MESOS-4673
URL: https://issues.apache.org/jira/browse/MESOS-4673
Project: Mesos
Issue Type: Bug
Reporter: Jan Schlicht
Assignee: Jan Schlicht
Under certain conditions, when a mesos agent looses connection to the master
for an extended period of time (Say a switch fails), the master will
de-register the agent, and then when the agent comes back up, refuse to let it
register: {{Slave asked to shut down by [email protected]:5050 because 'Slave
attempted to re-register after removal'}}.
The agent doesn't seem to be able to properly shutdown and remove running tasks
as it should do to register as a new agent. Hence this message will persist
until it's resolved by manual intervetion.
This seems to be caused by Docker tasks that couldn't shutdown cleanly when the
agent is asked to shutdown running tasks to be able to register as a new agent
with the master.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)