suneet-s commented on PR #16442: URL: https://github.com/apache/druid/pull/16442#issuecomment-2114023021
> The Supervisor will transition to the new UNHEALTHY_TASKS_STOP_CREATING_NEW state only if three conditions are met: "no progress" AND all failed task are due to parsing exceptions AND the count of failed task exceed TaskUnhealthinessThreshold. Under these circumstances, we can attribute the parsing exceptions as the root cause of the Supervisor's lack of progress. Why limit this functionality just to stuck supervisors because of parse exceptions? If a supervisor is failing continuously (because of OOMs or something else) and the offsets to read from are not moving forward, it seems like stopping the supervisor and notifying the operator would be a good thing to do. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
