[ 
https://issues.apache.org/jira/browse/MESOS-7783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

A. Dukhovniy updated MESOS-7783:
--------------------------------
    Attachment: GroupDeployIntegrationTest.log.zip

Here is a full log from our test suite. Every test suite starts it's own 
marathon+master+agent+zk bundle and runs the individual tests sequentially. 
There is a {{cleanUp}} after each test which removes all the apps ensuring a 
clean plate for the next test.

The interesting part for you starts here:
{code:java}
consoleText211.txt:DEBUG[23:17:07 
GroupDeployIntegrationTest-LocalMarathon-32799] INFO [23:17:07 
GroupVersioningUtil$] 
[/group-1/app-with-running-deployment-cannot-be-deleted-without-force]: new app 
detected
...
{code}


> Framework might not receive status update when a just launched task is killed 
> immediately
> -----------------------------------------------------------------------------------------
>
>                 Key: MESOS-7783
>                 URL: https://issues.apache.org/jira/browse/MESOS-7783
>             Project: Mesos
>          Issue Type: Bug
>          Components: agent
>    Affects Versions: 1.2.0
>            Reporter: Benjamin Bannier
>            Priority: Critical
>         Attachments: GroupDeployIntegrationTest.log.zip, logs
>
>
> Our Marathon team are seeing issues in their integration test suite when 
> Marathon gets stuck in an infinite loop trying to kill a just launched task. 
> In their test a task launched which is immediately followed by killing the 
> task -- the framework does e.g., not wait for any task status update.
> In this case the launch and kill messages arrive at the agent in the correct 
> order, but both the launch and kill paths in the agent do not reach the point 
> where a status update is sent to the framework. Since the framework has seen 
> no status update on the task it re-triggers a kill, causing an infinite loop.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to