Gyula Fora created FLINK-30680:
----------------------------------
Summary: Consider using the autoscaler to detect slow taskmanagers
Key: FLINK-30680
URL: https://issues.apache.org/jira/browse/FLINK-30680
Project: Flink
Issue Type: New Feature
Components: Autoscaler, Kubernetes Operator
Reporter: Gyula Fora
We could leverage logic in the autoscaler to detect slow taskmanagers by
comparing the per-record processing times between them.
If we notice that all subtasks on a single TM are considerably slower than the
rest (at similar input rates) we should try simply restarting the job instead
of scaling it up.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)