Re: Review Request 22313: MESOS-886: Prevented slave from launching tasks before containerize's update completes.

Yifan Gu Tue, 17 Jun 2014 18:59:24 -0700


> On June 17, 2014, 12:28 a.m., Vinod Kone wrote:
> > src/slave/slave.cpp, lines 1204-1206
> > <https://reviews.apache.org/r/22313/diff/7/?file=607037#file607037line1204>
> >
> >     The slave cannot be in RECOVERING. See the comment on line #1099.


I see! Thank you!


> On June 17, 2014, 12:28 a.m., Vinod Kone wrote:
> > src/slave/slave.cpp, lines 1219-1224
> > <https://reviews.apache.org/r/22313/diff/7/?file=607037#file607037line1219>
> >
> >     Is this possible? AFAICT, since the task was added to the executor the 
> > executor shouldn't be removed between _runTask() and __runTask(). Even if 
> > the executor terminates in between, this task should've been marked 
> > 'terminated' but not 'completed' (i.e., waiting for an ACK) and hence the 
> > executor won't be removed from the framework's map. Since there is a 
> > pending executor, the framework shouldn't be removed.
> >     
> >     So this can be a CHECK_NONTULL(framework) with a comment on why it can 
> > be a check.

Good point! I am currently trying to add a test to kill the framework before 
the containerizer->update() returns to test this. Thanks for pointing out!


> On June 17, 2014, 12:28 a.m., Vinod Kone wrote:
> > src/tests/slave_tests.cpp, lines 803-804
> > <https://reviews.apache.org/r/22313/diff/7/?file=607038#file607038line803>
> >
> >     I don't see where this test is ensuring that resources are isolated 
> > before launching tasks?
> >     
> >     I would recommend splitting this into 2 tests
> >     
> >     1) ensure resources are updated before task is launched
> >     
> >     2) containerizer failed causes task lost to be sent.
> >     
> >     i would also recommend writing a test that ensures that framework and 
> > executor are not removed between _runTask and __runTask().

Doing!


> On June 17, 2014, 12:28 a.m., Vinod Kone wrote:
> > src/tests/slave_tests.cpp, lines 864-865
> > <https://reviews.apache.org/r/22313/diff/7/?file=607038#file607038line864>
> >
> >     You should set these expectations just before you launch the second 
> > task. Also, why "WillRepeatedly"? Are you expecting more than 2 updates?

I forgot why I added the WillRepeatedly here, but I guess this is because the 
first task will send a TASK_LOST at the end. I will try to verify this in the 
new version. Thanks! Vinod!


- Yifan


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/22313/#review45858
-----------------------------------------------------------


On June 11, 2014, 9:32 a.m., Yifan Gu wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/22313/
> -----------------------------------------------------------
> 
> (Updated June 11, 2014, 9:32 a.m.)
> 
> 
> Review request for mesos, Ian Downes and Vinod Kone.
> 
> 
> Bugs: MESOS-886
>     https://issues.apache.org/jira/browse/MESOS-886
> 
> 
> Repository: mesos-git
> 
> 
> Description
> -------
> 
> Added __runTask() to wait for the completion of containerizer->update() and 
> check the result before sending RunTaskMessage.
> 
> 
> Diffs
> -----
> 
>   src/slave/slave.hpp 34687e5 
>   src/slave/slave.cpp 643c088 
>   src/tests/slave_tests.cpp 2c8f183 
> 
> Diff: https://reviews.apache.org/r/22313/diff/
> 
> 
> Testing
> -------
> 
> SlaveTest, CancelTaskIfContainerizerFails
> 
> Which tests that if the containerizer->update() return a Failure, the task 
> won't be launched and the scheduler will get TASK_LOST.
> 
> make check
> 
> 
> Thanks,
> 
> Yifan Gu
> 
>

Re: Review Request 22313: MESOS-886: Prevented slave from launching tasks before containerize's update completes.

Reply via email to