Gyula Fora created FLINK-30680: ---------------------------------- Summary: Consider using the autoscaler to detect slow taskmanagers Key: FLINK-30680 URL: https://issues.apache.org/jira/browse/FLINK-30680 Project: Flink Issue Type: New Feature Components: Autoscaler, Kubernetes Operator Reporter: Gyula Fora
We could leverage logic in the autoscaler to detect slow taskmanagers by comparing the per-record processing times between them. If we notice that all subtasks on a single TM are considerably slower than the rest (at similar input rates) we should try simply restarting the job instead of scaling it up. -- This message was sent by Atlassian Jira (v8.20.10#820010)