Gyula Fora created FLINK-26135:
----------------------------------
Summary: Separate job and deployment errors in FlinkDeployment
status
Key: FLINK-26135
URL: https://issues.apache.org/jira/browse/FLINK-26135
Project: Flink
Issue Type: Sub-task
Components: Deployment / Kubernetes
Reporter: Gyula Fora
Assignee: Gyula Fora
At the moment the controller does not validate or tolerate any deployment
errors such as incorrect configurations etc. Those will lead to an exception
loop in the reconcile logic.
There are cases where the job deployment cannot be executed due to incorrect
configuration or other causes. In these cases the job can still be running
correctly so the job status should be OK but we should signal a deployment
error to the user that requires action.
There should be a shared validation logic between the controller and the
webhook that should be applied whenever a new FlinkDeployment update is
received by the controller. If an error is detected in the controller, set the
deployment status to error with a useful message and leave the current job
running.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)