zhenyuT opened a new issue, #3423:
URL: https://github.com/apache/incubator-streampark/issues/3423

   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/incubator-streampark/issues?q=is%3Aissue+label%3A%22bug%22)
 and found no similar issues.
   
   
   ### Java Version
   
   java version "1.8.0_181"
   Java(TM) SE Runtime Environment (build 1.8.0_181-b13)
   Java HotSpot(TM) 64-Bit Server VM (build 25.181-b13, mixed mode)
   
   
   ### Scala Version
   
   2.12.x
   
   ### StreamPark Version
   
   apache-streampark_2.12-2.1.1-incubating
   
   ### Flink Version
   
   1.15.3
   
   ### deploy mode
   
   kubernetes-application
   
   ### What happened
   
   flink on k8s高可用采用zk,通过故障注入工具模拟JobManager访问zk节点超时(网络延时30s,持续120s后恢复)
   通过命令行方式启动任务,JobManager会自动restart,网络延时故障结束后任务能自动恢复正常
   通过streampark方式启动,JobManager重新启动的时候,突然整个deployment都被remove
   
   查看streampark日志,发现是streampark监听到flink任务fail,触发了delete deployment操作
   
![1703233420947_125D8297-C2B0-462f-8A04-170833A89075](https://github.com/apache/incubator-streampark/assets/9477681/7a91cc72-a54b-4ca1-8402-c6430f25728b)
   
![1703233457034_5E41A593-3067-4eb3-A168-2D5A182D96DC](https://github.com/apache/incubator-streampark/assets/9477681/b439ee7b-5357-4f74-af73-d22dd6fab962)
   
   
   ### Error Exception
   
   _No response_
   
   ### Screenshots
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!(您是否要贡献这个PR?)
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to