Gyula Fora created FLINK-30680:
----------------------------------

             Summary: Consider using the autoscaler to detect slow taskmanagers
                 Key: FLINK-30680
                 URL: https://issues.apache.org/jira/browse/FLINK-30680
             Project: Flink
          Issue Type: New Feature
          Components: Autoscaler, Kubernetes Operator
            Reporter: Gyula Fora


We could leverage logic in the autoscaler to detect slow taskmanagers by 
comparing the per-record processing times between them.

If we notice that all subtasks on a single TM are considerably slower than the 
rest (at similar input rates) we should try simply restarting the job instead 
of scaling it up.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to