jihoonson commented on issue #7828: Supervisor marks succeeded replicas as 
failed too aggressively
URL: 
https://github.com/apache/incubator-druid/issues/7828#issuecomment-498826492
 
 
   I have been looking at more deeply and noticed that this is what's happening.
   
   - The task was finished and unregistered its chatHandler.
   - But the process which was running the task was not terminated immediately. 
For example, `unannouncePropagationDelay` can block the process from being 
terminated for a while.
   - The task status is updated only after the task process terminates 
(`ForkingTaskRunner`).
   - So, while the task process was waiting to be terminated, the task status 
in the metadata store was still `RUNNING` even though the task itself was 
already finished.
   - The supervisor killed the task because it returned an error of `Can't find 
chatHandler` for the shutdown request.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@druid.apache.org
For additional commands, e-mail: commits-h...@druid.apache.org

Reply via email to