Rui Fan created FLINK-34906:
-------------------------------
Summary: Don't start autoscaling when some tasks are not running
Key: FLINK-34906
URL: https://issues.apache.org/jira/browse/FLINK-34906
Project: Flink
Issue Type: Improvement
Components: Autoscaler
Reporter: Rui Fan
Assignee: Rui Fan
Fix For: 1.9.0
Attachments: image-2024-03-21-17-40-23-523.png
Currently, the autoscaler will scale a job when the JobStatus is RUNNING. But
the JobStatus will be RUNNING once job starts schedule, so it doesn't mean all
tasks are running. Especially, when the resource isn't enough or job recovers
from large state.
The autoscaler will throw exception and generate the AutoscalerError event when
tasks are not ready, such as:
!image-2024-03-21-17-40-23-523.png!
Solution: we only scale job that all tasks are running(some of tasks may be
finished).
--
This message was sent by Atlassian Jira
(v8.20.10#820010)