zhenyuT opened a new issue, #3423: URL: https://github.com/apache/incubator-streampark/issues/3423
### Search before asking - [X] I had searched in the [issues](https://github.com/apache/incubator-streampark/issues?q=is%3Aissue+label%3A%22bug%22) and found no similar issues. ### Java Version java version "1.8.0_181" Java(TM) SE Runtime Environment (build 1.8.0_181-b13) Java HotSpot(TM) 64-Bit Server VM (build 25.181-b13, mixed mode) ### Scala Version 2.12.x ### StreamPark Version apache-streampark_2.12-2.1.1-incubating ### Flink Version 1.15.3 ### deploy mode kubernetes-application ### What happened flink on k8s高可用采用zk,通过故障注入工具模拟JobManager访问zk节点超时(网络延时30s,持续120s后恢复) 通过命令行方式启动任务,JobManager会自动restart,网络延时故障结束后任务能自动恢复正常 通过streampark方式启动,JobManager重新启动的时候,突然整个deployment都被remove 查看streampark日志,发现是streampark监听到flink任务fail,触发了delete deployment操作   ### Error Exception _No response_ ### Screenshots _No response_ ### Are you willing to submit PR? - [ ] Yes I am willing to submit a PR!(您是否要贡献这个PR?) ### Code of Conduct - [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
