Ian Downes created MESOS-2473:
---------------------------------

             Summary: Failure to recover because of freezer timeout should not 
suggest removing meta data
                 Key: MESOS-2473
                 URL: https://issues.apache.org/jira/browse/MESOS-2473
             Project: Mesos
          Issue Type: Improvement
          Components: isolation
    Affects Versions: 0.22.0
            Reporter: Ian Downes


A more appropriate action should be suggested, e.g., manually kill the 
processes in cgroup <xxx> because the slave will still attempt to clean up 
orphans and hit the same code path.

{noformat}
I0310 23:04:23.961019 32342 slave.cpp:3321] Current usage 35.87%. Max allowed 
age: 3.789365411204225days
Failed to perform recovery: Collect failed: Timed out after 1mins
To remedy this do as follows:
Step 1: rm -f /var/lib/mesos/meta/slaves/latest
        This ensures slave doesn't recover old live executors.
Step 2: Restart the slave.
Slave Exit Status: 1
{noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to