> On June 17, 2014, 12:28 a.m., Vinod Kone wrote: > > src/slave/slave.cpp, lines 1204-1206 > > <https://reviews.apache.org/r/22313/diff/7/?file=607037#file607037line1204> > > > > The slave cannot be in RECOVERING. See the comment on line #1099.
I see! Thank you! > On June 17, 2014, 12:28 a.m., Vinod Kone wrote: > > src/slave/slave.cpp, lines 1219-1224 > > <https://reviews.apache.org/r/22313/diff/7/?file=607037#file607037line1219> > > > > Is this possible? AFAICT, since the task was added to the executor the > > executor shouldn't be removed between _runTask() and __runTask(). Even if > > the executor terminates in between, this task should've been marked > > 'terminated' but not 'completed' (i.e., waiting for an ACK) and hence the > > executor won't be removed from the framework's map. Since there is a > > pending executor, the framework shouldn't be removed. > > > > So this can be a CHECK_NONTULL(framework) with a comment on why it can > > be a check. Good point! I am currently trying to add a test to kill the framework before the containerizer->update() returns to test this. Thanks for pointing out! > On June 17, 2014, 12:28 a.m., Vinod Kone wrote: > > src/tests/slave_tests.cpp, lines 803-804 > > <https://reviews.apache.org/r/22313/diff/7/?file=607038#file607038line803> > > > > I don't see where this test is ensuring that resources are isolated > > before launching tasks? > > > > I would recommend splitting this into 2 tests > > > > 1) ensure resources are updated before task is launched > > > > 2) containerizer failed causes task lost to be sent. > > > > i would also recommend writing a test that ensures that framework and > > executor are not removed between _runTask and __runTask(). Doing! > On June 17, 2014, 12:28 a.m., Vinod Kone wrote: > > src/tests/slave_tests.cpp, lines 864-865 > > <https://reviews.apache.org/r/22313/diff/7/?file=607038#file607038line864> > > > > You should set these expectations just before you launch the second > > task. Also, why "WillRepeatedly"? Are you expecting more than 2 updates? I forgot why I added the WillRepeatedly here, but I guess this is because the first task will send a TASK_LOST at the end. I will try to verify this in the new version. Thanks! Vinod! - Yifan ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22313/#review45858 ----------------------------------------------------------- On June 11, 2014, 9:32 a.m., Yifan Gu wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/22313/ > ----------------------------------------------------------- > > (Updated June 11, 2014, 9:32 a.m.) > > > Review request for mesos, Ian Downes and Vinod Kone. > > > Bugs: MESOS-886 > https://issues.apache.org/jira/browse/MESOS-886 > > > Repository: mesos-git > > > Description > ------- > > Added __runTask() to wait for the completion of containerizer->update() and > check the result before sending RunTaskMessage. > > > Diffs > ----- > > src/slave/slave.hpp 34687e5 > src/slave/slave.cpp 643c088 > src/tests/slave_tests.cpp 2c8f183 > > Diff: https://reviews.apache.org/r/22313/diff/ > > > Testing > ------- > > SlaveTest, CancelTaskIfContainerizerFails > > Which tests that if the containerizer->update() return a Failure, the task > won't be launched and the scheduler will get TASK_LOST. > > make check > > > Thanks, > > Yifan Gu > >