Karsten created MESOS-8762:
------------------------------

             Summary: Farmework Teardown Leaves Task in Uninterruptible Sleep 
State D
                 Key: MESOS-8762
                 URL: https://issues.apache.org/jira/browse/MESOS-8762
             Project: Mesos
          Issue Type: Bug
            Reporter: Karsten


The Marathon has a testsuite that starts a Python simple HTTP server in a task 
group aka pod in Marathon. After the test run we call {{/teardown}} and wait 
for the Marathon framework to be completed (see 
[MesosTest|https://github.com/mesosphere/marathon/blob/master/src/test/scala/mesosphere/marathon/integration/setup/MesosTest.scala#L311]).

 

Our CI checks whether we leak any tasks after all test runs. It turns out we do:

{code}
Will kill:
  root     18084  0.0  0.0  45380 13612 ?        D    07:52   0:00 python 
src/app_mock.py 35477 resident-pod-16322-fail 2018-04-06T07:52:16.924Z 
http://www.example.com
Running 'sudo kill -9 18084
Wait for processes being killed...
...
Couldn't kill some leaked processes:
  root     18084  0.0  0.0  45380 13612 ?        D    07:52   0:00 python 
src/app_mock.py 35477 resident-pod-16322-fail 2018-04-06T07:52:16.924Z 
http://www.example.com
ammonite.$file.ci.utils$StageException: Stage Compile and Test failed.
{code}





--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to