Maxim Khutornenko created AURORA-669:
----------------------------------------

             Summary: Thermos runner collect_updates() gets stuck in a while 
loop when timeout is 0
                 Key: AURORA-669
                 URL: https://issues.apache.org/jira/browse/AURORA-669
             Project: Aurora
          Issue Type: Bug
          Components: Thermos
            Reporter: Maxim Khutornenko


The following code in runner.py:collect_update() may result in an infinite 
while loop when the provided timeout is passed as 0:

{noformat}
 while True:
       ...
        if timeout and total_time >= timeout:
          break
       ...
{noformat}

We have observed a case when the thermos runner gets stuck in a "deadlocked" 
state not reacting to SIGTERM with the last message in __main__.log as:
{noformat}
D0827 15:35:26.022495 30886 runner.py:856] Run loop: Work to be done within 0.0s
{noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to