Vinod Kone created MESOS-614:
--------------------------------

             Summary: Master should remove checkpointing slave that gets 
disconnected when the new slave tries to register
                 Key: MESOS-614
                 URL: https://issues.apache.org/jira/browse/MESOS-614
             Project: Mesos
          Issue Type: Improvement
            Reporter: Vinod Kone
            Assignee: Vinod Kone
             Fix For: 0.13.0


When a checkpointing slave is not able to recover (for whatever reason) it 
tries to register as a new slave. But if this registration happens before 
master has removed the old slave, the master simply gives the old slave id for 
the new slave. This means the master thinks the slave is running a bunch of 
tasks whereas the slave thinks it is new.

Master should remove the slave from its map (send TASK_LOST updates) when this 
happens and create a new slave entry.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to