[ 
https://issues.apache.org/jira/browse/AURORA-669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14118388#comment-14118388
 ] 

Bill Farner commented on AURORA-669:
------------------------------------

>From catching up on other threads, this seemed pretty urgent to fix.  Should 
>this be handed off to someone else or is a fix in the works?

> Thermos runner collect_updates() gets stuck in a while loop when timeout is 0
> -----------------------------------------------------------------------------
>
>                 Key: AURORA-669
>                 URL: https://issues.apache.org/jira/browse/AURORA-669
>             Project: Aurora
>          Issue Type: Bug
>          Components: Thermos
>            Reporter: Maxim Khutornenko
>            Assignee: brian wickman
>
> The following code in runner.py:collect_update() may result in an infinite 
> while loop when the provided timeout is passed as 0:
> {noformat}
>  while True:
>        ...
>         if timeout and total_time >= timeout:
>           break
>        ...
> {noformat}
> We have observed a case when the thermos runner gets stuck in a "deadlocked" 
> state not reacting to SIGTERM with the last message in __main__.log as:
> {noformat}
> D0827 15:35:26.022495 30886 runner.py:856] Run loop: Work to be done within 
> 0.0s
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to