gyfora opened a new pull request, #541:
URL: https://github.com/apache/flink-kubernetes-operator/pull/541
## What is the purpose of the change
The current JM Deployment logic that restarts missing deployments strictly
requires HA metadata event for stateless deployments.
This is inconsistent with how the cluster health check related restarts work
which can cause the operator to delete an unhealthy deployment and potentially
leave it missing if the first deploy attempt fails.
## Brief change log
- Allow JM deployment recovery for stateless jobs
- Require HA meta based on the HA config
- Add hotfix for missing wait for cluster shutdown
## Verifying this change
Extended deployment recovery test to cover the stateless case.
## Does this pull request potentially affect one of the following parts:
- Dependencies (does it add or upgrade a dependency): no
- The public API, i.e., is any changes to the `CustomResourceDescriptors`:
no
- Core observer or reconciler logic that is regularly executed: no
## Documentation
- Does this pull request introduce a new feature? no
- If yes, how is the feature documented? not applicable
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]