Ian Downes created MESOS-2473:
---------------------------------
Summary: Failure to recover because of freezer timeout should not
suggest removing meta data
Key: MESOS-2473
URL: https://issues.apache.org/jira/browse/MESOS-2473
Project: Mesos
Issue Type: Improvement
Components: isolation
Affects Versions: 0.22.0
Reporter: Ian Downes
A more appropriate action should be suggested, e.g., manually kill the
processes in cgroup <xxx> because the slave will still attempt to clean up
orphans and hit the same code path.
{noformat}
I0310 23:04:23.961019 32342 slave.cpp:3321] Current usage 35.87%. Max allowed
age: 3.789365411204225days
Failed to perform recovery: Collect failed: Timed out after 1mins
To remedy this do as follows:
Step 1: rm -f /var/lib/mesos/meta/slaves/latest
This ensures slave doesn't recover old live executors.
Step 2: Restart the slave.
Slave Exit Status: 1
{noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)