> On Feb. 20, 2013, 12:54 a.m., Ben Mahler wrote: > > src/slave/cgroups_isolation_module.cpp, line 437 > > <https://reviews.apache.org/r/9408/diff/6/?file=258837#file258837line437> > > > > EXIT is used for messages that the mesos user / operator would read and > > need to take action in regard to. Given this is for orphaned executors, can > > you add more information to the message here to describe what could be > > wrong (so as to guide the user)?
There are multiple reasons a cgroup 'destroy' fails, as you can see in cgroups::destroy(). And, no, its not because another slave is running (see below). Afaict, it is not possible to tell the reason from the call site. Any ideas? > On Feb. 20, 2013, 12:54 a.m., Ben Mahler wrote: > > src/slave/cgroups_isolation_module.cpp, line 444 > > <https://reviews.apache.org/r/9408/diff/6/?file=258837#file258837line444> > > > > ditto, perhaps an indication that it's likely due to another slave > > running on this machine? Its not possible for another slave to be running, because we acquired the exclusive lock. This could fail if 'fd' is invalid/refers to an object other than file/incorrect descriptor. The strerror() would give the correct reason. - Vinod ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/9408/#review16763 ----------------------------------------------------------- On Feb. 19, 2013, 10:56 p.m., Vinod Kone wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/9408/ > ----------------------------------------------------------- > > (Updated Feb. 19, 2013, 10:56 p.m.) > > > Review request for mesos, Benjamin Hindman and Ben Mahler. > > > Description > ------- > > See summary > > > Diffs > ----- > > src/slave/cgroups_isolation_module.hpp > 669efa14ba2603764aa68ae19a44e79dbfdec192 > src/slave/cgroups_isolation_module.cpp > 14f549edaf1b37a6bca8f75309864333ae775e7c > src/slave/process_based_isolation_module.hpp > f1817192582e3646f8dcf17934ba7998829e8fd6 > src/slave/process_based_isolation_module.cpp > 12a579cba56cd3dac384bc7919b0d5537b0e429d > src/tests/balloon_framework_test.sh > 93a733f64cfde08349b7781eb3d5e13594c74498 > > Diff: https://reviews.apache.org/r/9408/diff/ > > > Testing > ------- > > make check > > > Thanks, > > Vinod Kone > >
